AI Practitioner Definitions and Services Cheat Sheet

Category Name Definition
Benefits of Cloud Computing Elasticity Elasticity refers to the ability to automatically scale resources up or down in response to changing demand, ensuring optimal performance and cost-efficiency.
Global Reach Global reach is the ability to deploy applications and services in data centers around the world, providing low-latency access and consistent user experiences regardless of geographic location.
Pay-As-You-Go Pricing Pay-as-you-go pricing means you pay only for the resources and services you use, without upfront costs or long-term commitments.
Agility Agility refers to the ability to rapidly develop, test, and deploy applications, enabling faster innovation and adaptation to changing business needs.
Fault Tolerance Fault tolerance is the capability of a system to continue operating properly even when one or more components fail.
Scalability Scalability refers to the ability to handle increasing or decreasing workloads by dynamically adjusting resources without affecting performance.
Types of Cloud Computing Public Cloud Public Cloud is a cloud computing model where services are delivered over the internet and shared across multiple organizations.
Private Cloud Private Cloud is a cloud environment dedicated to a single organization, offering greater control and security.
Hybrid Cloud Hybrid Cloud combines public and private clouds to enable data and application sharing between them for greater flexibility.
Multi Cloud Multi-Cloud involves using services from multiple cloud providers to avoid vendor lock-in and optimize performance or cost.
Machine Learning Methods Classification A supervised learning task that assigns inputs into predefined categories or classes.
Regression A supervised learning task that predicts continuous numerical values based on input data.
Clustering An unsupervised learning technique that groups similar data points together based on their features.
Overfitting A modeling error where a machine learning model learns the training data too well, including noise, resulting in poor generalization.
Underfitting A scenario where a model is too simple to capture the underlying patterns in the data, leading to poor performance on both training and test sets.
Generative AI Evaluation Metrics ROGUE A metric that evaluates the quality of generated summaries by measuring n-gram overlap with reference summaries.
BLEU A metric for assessing machine translation quality based on the overlap of n-grams between the machine output and human reference translations
GLUE A benchmark suite that evaluates NLP models on a diverse set of natural language understanding tasks.
MMLU A benchmark that measures a model’s performance across a wide range of academic and professional subjects to assess knowledge and reasoning capabilities.
Machine Learning Concepts Supervised Learning A machine learning approach where the model is trained on labeled data to predict outcomes.
Unsupervised Learning A machine learning method that finds patterns or structures in data without labeled outcomes
Reinforcement Learning A learning paradigm where an agent learns to make decisions by interacting with an environment and receiving feedback through rewards or penalties.
Transfer Learning A technique where a model trained on one task is adapted for a related but different task, leveraging prior knowledge.
Retrieval Augemented Generation (RAG) A method that combines information retrieval with text generation, enabling models to generate responses using both pre-trained knowledge and external documents.
Prompt Techniques Few Shot Prompting Providing a few examples to help the LLM understand and generate responses better.
Zero Shot Prompting Providing a prompt with no examples. The model generates responses based on its
training.
Prompt Templates Using predefined structures that include instructions, examples, and specific content.
Chain of thought Prompting Breaking down complex tasks into intermediate steps to improve coherence.
Prompt Tuning Replacing the prompt text with continuous embeddings optimized during training.
Inference Parameters Temperature A parameter that controls the randomness of a model’s output, higher values produce more diverse responses, while lower values make outputs more focused and deterministic
Top K A decoding technique where the model considers only the top K most probable tokens at each step, increasing coherence by limiting choice to the most likely options
Top P A decoding method where the model selects from the smallest set of tokens whose cumulative probability exceeds P, balancing diversity and relevance dynamically.
Response Length The maximum number of tokens the model is allowed to generate in its output
Stop Sequences User-defined token sequences that signal the model to stop generating further text when encountered.
Penalties Controls that reduce the likelihood of repeating words or phrases by penalizing tokens already used in the generation
Responsible AI Fairness Ensuring AI systems do not produce biased outcomes and treat all individuals or groups equitably.
Explainability The ability to understand and articulate how an AI model makes its decisions or predictions.
Robustness The resilience of an AI system to adversarial inputs, errors, or changes in data without degrading performance.
Privacy Protecting user data and ensuring AI systems do not expose or misuse sensitive personal information.
Governance The policies, processes, and oversight mechanisms put in place to ensure ethical and compliant AI development and use.
Transparency Making AI systems and their decision-making processes open, accessible, and understandable to stakeholders.
Types of Service Models IaaS (Infrastructure as a Service IaaS (Infrastructure as a Service) provides virtualized computing resources like servers, storage, and networking on demand.
PaaS (Platform as a Service) PaaS (Platform as a Service) offers a platform with tools and services for developers to build, deploy, and manage applications without managing underlying infrastructure.
SaaS (Software as a Service) SaaS (Software as a Service) delivers software applications over the internet, accessible via a web browser without local installation.
FaaS (Function as a Service) FaaS (Function as a Service) allows developers to run individual functions or pieces of code in response to events, without managing servers or infrastructure.
AWS Compute Services AWS EC2 AWS EC2 (Elastic Compute Cloud) provides scalable virtual servers to run applications in the cloud.
AWS Lambda AWS Lambda lets you run code in response to events without provisioning or managing servers.
AWS ECS AWS ECS (Elastic Container Service) is a fully managed container orchestration service to run and scale Docker containers.
AWS EKS AWS EKS (Elastic Kubernetes Service) is a managed Kubernetes service for running containerized applications using Kubernetes on AWS.
Amazon Fargate Amazon Fargate is a serverless compute engine that runs containers without requiring you to manage servers or clusters.
AWS Storage Services Amazon S3 Amazon S3 (Simple Storage Service) is an object storage service that offers scalable, secure, and durable storage for data of any type.
AWS Database Services AWS RDS AWS RDS (Relational Database Service) is a managed service for setting up, operating, and scaling relational databases like MySQL, PostgreSQL, and SQL Server.
AWS Redshift AWS Redshift is a fully managed, petabyte-scale cloud data warehouse service designed for fast and cost-effective analysis of large datasets using SQL.
AWS DynamoDB AWS DynamoDB is a fully managed NoSQL database service that provides fast and flexible performance at scale.
AWS Aurora AWS Aurora is a high-performance, fully managed relational database compatible with MySQL and PostgreSQL.
AWS Neptune AWS Neptune is a fully managed graph database service optimized for storing and querying highly connected data.
AWS Networking Services AWS VPC AWS VPC (Virtual Private Cloud) lets you provision a logically isolated network within AWS where you can launch resources in a defined IP range.
AWS Direct Connect AWS Direct Connect establishes a dedicated, high-speed, and low-latency network connection between your on-premises data center and AWS.
AWS VPN AWS VPN provides secure connections between your on-premises network or client devices and your AWS environment over the internet.
AWS Global Accelerator AWS Global Accelerator is a networking service that improves the availability and performance of your applications by directing user traffic through the AWS global network infrastructure.
AWS CloudFront AWS CloudFront is a content delivery network (CDN) that securely delivers data, videos, applications, and APIs with low latency.
AWS Monitoring Services AWS CloudWatch AWS CloudWatch is a monitoring service that collects and tracks metrics, logs, and events to provide real-time visibility into AWS resources and applications.
AWS CloudTrail AWS CloudTrail is a service that records AWS API calls and user activity to enable governance, compliance, and operational auditing.
AWS X-Ray A distributed tracing service that helps debug and analyze performance issues in microservices applications.
AWS Security Services AWS Cognito AWS Cognito enables secure user authentication, authorization, and user management for web and mobile apps.
AWS Artifact AWS Artifact is a service that provides on-demand access to AWS compliance reports, security and compliance documentation, and agreements.
AWS IAM AWS IAM (Identity and Access Management) allows you to securely manage access to AWS services and resources for users and roles.
AWS Config AWS Config continuously monitors and records your AWS resource configurations to assess compliance and detect changes.
Amazon Inspector Amazon Inspector automatically assesses applications for vulnerabilities or deviations from best practices.
AWS KMS AWS KMS (Key Management Service) enables you to create and manage cryptographic keys and control their use across AWS services.
AWS Macie AWS Macie uses machine learning to discover, classify, and protect sensitive data such as personally identifiable information (PII) in AWS.
AWS Machine Learning Services AWS SageMaker Amazon SageMaker is a fully managed service that enables you to build, train, and deploy machine learning models at scale.
AWS Lex AWS Lex is a service for building conversational interfaces using voice and text, powered by the same technology as Amazon Alexa.
AWS Polly AWS Polly converts text into lifelike speech, enabling you to create applications that talk.
AWS Chatbot AWS Chatbot is an interactive agent that enables DevOps and monitoring notifications in Slack and Amazon Chime.
AWS Rekognition AWS Rekognition is a deep learning-based image and video analysis service that can identify objects, people, text, scenes, and activities, as well as detect inappropriate content.
AWS Comprehend AWS Comprehend is a natural language processing (NLP) service that uses machine learning to uncover insights and relationships in text.
Amazon Q An AI-powered assistant for AWS that helps developers and IT professionals answer questions, write code, and troubleshoot faster.
Amazon Bedrock A fully managed service that allows you to build and scale generative AI applications using foundation models via API without managing infrastructure.
Amazon Translate A neural machine translation service that delivers fast, high-quality language translation.
Amazon Transcribe An automatic speech recognition (ASR) service that converts speech to text quickly and accurately.
AWS Partyrock A no-code generative AI app builder that lets you create and share AI-powered apps easily using Amazon Bedrock.
Amazon Mechnical Turk A crowdsourcing marketplace that enables businesses to outsource tasks to a global workforce.
Amazon Augmented Intelligence(A2I) A service that enables human review of machine learning predictions to improve accuracy and compliance.
AWS Kendra An intelligent search service powered by machine learning that enables organizations to search unstructured data using natural language.
AWS Textract Amazon Textract is a machine learning service that automatically extracts text, handwriting, and data from scanned documents and forms.
SageMaker Specialization Services SageMaker Ground Truth A data labeling service to build highly accurate training datasets for machine learning quickly.
SageMaker Canvas Enables business analysts to build machine learning models and generate accurate predictions without writing code.
SageMaker Experiments A tool to organize, track, compare, and evaluate machine learning experiments.
SageMaker Model Monitor Monitors deployed models in production for data and model quality issues and automatically detects and alerts on potential problems.
SageMaker Pipelines A service to build, automate, and manage endto-end machine learning workflows.
SageMaker Model Registery A centralized repository to store, version, and manage machine learning models.
SageMaker Feature Store A purpose-built repository for storing, retrieving, and sharing machine learning features.
SageMaker Inference Recommender Helps select the best compute instance and configuration for inference workloads by running benchmark tests on different configurations.
SageMaker Serverless Inference Allows serving machine learning models without managing infrastructure, automatically scaling based on traffic patterns.
SageMaker Real time Inference Provides persistent endpoints for real-time inference that are fully managed and can automatically scale.
SageMaker Batch Transform A service for offline inference that processes large datasets in batches.
SageMaker Asynchronous Interface Supports workloads that involve large payloads or have long inference processing times, decoupling request and response so clients don't have to wait for the inference response.
AWS Analytics and Streaming Services AWS Kinesis AWS Kinesis is a fully managed service for collecting, processing, and analyzing real-time streaming data at scale.
AWS Glue AWS Glue is a fully managed extract, transform, and load (ETL) service that helps prepare and move data for analytics and machine learning.
Amazon Athena A serverless query service that lets you analyze data directly in Amazon S3 using standard SQL.
Amazon EMR A managed cluster platform that simplifies running big data frameworks like Apache Spark and Hadoop.
Amazon QuickSight A scalable business intelligence service for creating interactive dashboards and visualizations.
AWS Lake Formation A service that simplifies the process of building, securing, and managing data lakes on AWS.
AWS Data Migration Services AWS Snowball AWS Snowball is a physical data transfer device that helps move large amounts of data into or out of AWS securely and efficiently.
AWS Data Migration Service (DMS) Helps migrate databases to AWS quickly and securely with minimal downtime.
AWS DataSync Automates and accelerates moving large amounts of data between on-premises storage and AWS services.
AWS Transfer Family Enables secure file transfers directly into and out of Amazon S3 using SFTP, FTPS, and FTP.
AWS Notification Services AWS SNS AWS SNS (Simple Notification Service) is a fully managed messaging service for sending notifications or messages to distributed systems, applications, or users.
AWS SQS AWS SQS (Simple Queue Service) is a fully managed message queuing service that enables decoupling and scaling of microservices, distributed systems, and serverless applications.
AWS Billing and Cost Management AWS Budgets AWS Budgets enables you to set custom cost and usage budgets and receive alerts when thresholds are exceeded.
AWS Organizations AWS Organizations lets you centrally manage and govern multiple AWS accounts for consolidated billing and policy control.
AWS Support AWS Support provides technical assistance and guidance through various support plans, including 24/7 access to AWS experts.
AWS Pricing Calculator AWS Pricing Calculator helps you estimate the cost of AWS services based on your specific configuration and usage needs.
Complete and Continue