AI Practitioner Definitions and Services Cheat Sheet
Category | Name | Definition |
---|---|---|
Benefits of Cloud Computing | Elasticity | Elasticity refers to the ability to automatically scale resources up or down in response to changing demand, ensuring optimal performance and cost-efficiency. |
Global Reach | Global reach is the ability to deploy applications and services in data centers around the world, providing low-latency access and consistent user experiences regardless of geographic location. | |
Pay-As-You-Go Pricing | Pay-as-you-go pricing means you pay only for the resources and services you use, without upfront costs or long-term commitments. | |
Agility | Agility refers to the ability to rapidly develop, test, and deploy applications, enabling faster innovation and adaptation to changing business needs. | |
Fault Tolerance | Fault tolerance is the capability of a system to continue operating properly even when one or more components fail. | |
Scalability | Scalability refers to the ability to handle increasing or decreasing workloads by dynamically adjusting resources without affecting performance. | |
Types of Cloud Computing | Public Cloud | Public Cloud is a cloud computing model where services are delivered over the internet and shared across multiple organizations. |
Private Cloud | Private Cloud is a cloud environment dedicated to a single organization, offering greater control and security. | |
Hybrid Cloud | Hybrid Cloud combines public and private clouds to enable data and application sharing between them for greater flexibility. | |
Multi Cloud | Multi-Cloud involves using services from multiple cloud providers to avoid vendor lock-in and optimize performance or cost. | |
Machine Learning Methods | Classification | A supervised learning task that assigns inputs into predefined categories or classes. |
Regression | A supervised learning task that predicts continuous numerical values based on input data. | |
Clustering | An unsupervised learning technique that groups similar data points together based on their features. | |
Overfitting | A modeling error where a machine learning model learns the training data too well, including noise, resulting in poor generalization. | |
Underfitting | A scenario where a model is too simple to capture the underlying patterns in the data, leading to poor performance on both training and test sets. | |
Generative AI Evaluation Metrics | ROGUE | A metric that evaluates the quality of generated summaries by measuring n-gram overlap with reference summaries. |
BLEU | A metric for assessing machine translation quality based on the overlap of n-grams between the machine output and human reference translations | |
GLUE | A benchmark suite that evaluates NLP models on a diverse set of natural language understanding tasks. | |
MMLU | A benchmark that measures a model’s performance across a wide range of academic and professional subjects to assess knowledge and reasoning capabilities. | |
Machine Learning Concepts | Supervised Learning | A machine learning approach where the model is trained on labeled data to predict outcomes. |
Unsupervised Learning | A machine learning method that finds patterns or structures in data without labeled outcomes | |
Reinforcement Learning | A learning paradigm where an agent learns to make decisions by interacting with an environment and receiving feedback through rewards or penalties. | |
Transfer Learning | A technique where a model trained on one task is adapted for a related but different task, leveraging prior knowledge. | |
Retrieval Augemented Generation (RAG) | A method that combines information retrieval with text generation, enabling models to generate responses using both pre-trained knowledge and external documents. | |
Prompt Techniques | Few Shot Prompting | Providing a few examples to help the LLM understand and generate responses better. |
Zero Shot Prompting | Providing a prompt with no examples. The model generates responses based on its training. |
|
Prompt Templates | Using predefined structures that include instructions, examples, and specific content. | |
Chain of thought Prompting | Breaking down complex tasks into intermediate steps to improve coherence. | |
Prompt Tuning | Replacing the prompt text with continuous embeddings optimized during training. | |
Inference Parameters | Temperature | A parameter that controls the randomness of a model’s output, higher values produce more diverse responses, while lower values make outputs more focused and deterministic |
Top K | A decoding technique where the model considers only the top K most probable tokens at each step, increasing coherence by limiting choice to the most likely options | |
Top P | A decoding method where the model selects from the smallest set of tokens whose cumulative probability exceeds P, balancing diversity and relevance dynamically. | |
Response Length | The maximum number of tokens the model is allowed to generate in its output | |
Stop Sequences | User-defined token sequences that signal the model to stop generating further text when encountered. | |
Penalties | Controls that reduce the likelihood of repeating words or phrases by penalizing tokens already used in the generation | |
Responsible AI | Fairness | Ensuring AI systems do not produce biased outcomes and treat all individuals or groups equitably. |
Explainability | The ability to understand and articulate how an AI model makes its decisions or predictions. | |
Robustness | The resilience of an AI system to adversarial inputs, errors, or changes in data without degrading performance. | |
Privacy | Protecting user data and ensuring AI systems do not expose or misuse sensitive personal information. | |
Governance | The policies, processes, and oversight mechanisms put in place to ensure ethical and compliant AI development and use. | |
Transparency | Making AI systems and their decision-making processes open, accessible, and understandable to stakeholders. | |
Types of Service Models | IaaS (Infrastructure as a Service | IaaS (Infrastructure as a Service) provides virtualized computing resources like servers, storage, and networking on demand. |
PaaS (Platform as a Service) | PaaS (Platform as a Service) offers a platform with tools and services for developers to build, deploy, and manage applications without managing underlying infrastructure. | |
SaaS (Software as a Service) | SaaS (Software as a Service) delivers software applications over the internet, accessible via a web browser without local installation. | |
FaaS (Function as a Service) | FaaS (Function as a Service) allows developers to run individual functions or pieces of code in response to events, without managing servers or infrastructure. | |
AWS Compute Services | AWS EC2 | AWS EC2 (Elastic Compute Cloud) provides scalable virtual servers to run applications in the cloud. |
AWS Lambda | AWS Lambda lets you run code in response to events without provisioning or managing servers. | |
AWS ECS | AWS ECS (Elastic Container Service) is a fully managed container orchestration service to run and scale Docker containers. | |
AWS EKS | AWS EKS (Elastic Kubernetes Service) is a managed Kubernetes service for running containerized applications using Kubernetes on AWS. | |
Amazon Fargate | Amazon Fargate is a serverless compute engine that runs containers without requiring you to manage servers or clusters. | |
AWS Storage Services | Amazon S3 | Amazon S3 (Simple Storage Service) is an object storage service that offers scalable, secure, and durable storage for data of any type. |
AWS Database Services | AWS RDS | AWS RDS (Relational Database Service) is a managed service for setting up, operating, and scaling relational databases like MySQL, PostgreSQL, and SQL Server. |
AWS Redshift | AWS Redshift is a fully managed, petabyte-scale cloud data warehouse service designed for fast and cost-effective analysis of large datasets using SQL. | |
AWS DynamoDB | AWS DynamoDB is a fully managed NoSQL database service that provides fast and flexible performance at scale. | |
AWS Aurora | AWS Aurora is a high-performance, fully managed relational database compatible with MySQL and PostgreSQL. | |
AWS Neptune | AWS Neptune is a fully managed graph database service optimized for storing and querying highly connected data. | |
AWS Networking Services | AWS VPC | AWS VPC (Virtual Private Cloud) lets you provision a logically isolated network within AWS where you can launch resources in a defined IP range. |
AWS Direct Connect | AWS Direct Connect establishes a dedicated, high-speed, and low-latency network connection between your on-premises data center and AWS. | |
AWS VPN | AWS VPN provides secure connections between your on-premises network or client devices and your AWS environment over the internet. | |
AWS Global Accelerator | AWS Global Accelerator is a networking service that improves the availability and performance of your applications by directing user traffic through the AWS global network infrastructure. | |
AWS CloudFront | AWS CloudFront is a content delivery network (CDN) that securely delivers data, videos, applications, and APIs with low latency. | |
AWS Monitoring Services | AWS CloudWatch | AWS CloudWatch is a monitoring service that collects and tracks metrics, logs, and events to provide real-time visibility into AWS resources and applications. |
AWS CloudTrail | AWS CloudTrail is a service that records AWS API calls and user activity to enable governance, compliance, and operational auditing. | |
AWS X-Ray | A distributed tracing service that helps debug and analyze performance issues in microservices applications. | |
AWS Security Services | AWS Cognito | AWS Cognito enables secure user authentication, authorization, and user management for web and mobile apps. |
AWS Artifact | AWS Artifact is a service that provides on-demand access to AWS compliance reports, security and compliance documentation, and agreements. | |
AWS IAM | AWS IAM (Identity and Access Management) allows you to securely manage access to AWS services and resources for users and roles. | |
AWS Config | AWS Config continuously monitors and records your AWS resource configurations to assess compliance and detect changes. | |
Amazon Inspector | Amazon Inspector automatically assesses applications for vulnerabilities or deviations from best practices. | |
AWS KMS | AWS KMS (Key Management Service) enables you to create and manage cryptographic keys and control their use across AWS services. | |
AWS Macie | AWS Macie uses machine learning to discover, classify, and protect sensitive data such as personally identifiable information (PII) in AWS. | |
AWS Machine Learning Services | AWS SageMaker | Amazon SageMaker is a fully managed service that enables you to build, train, and deploy machine learning models at scale. |
AWS Lex | AWS Lex is a service for building conversational interfaces using voice and text, powered by the same technology as Amazon Alexa. | |
AWS Polly | AWS Polly converts text into lifelike speech, enabling you to create applications that talk. | |
AWS Chatbot | AWS Chatbot is an interactive agent that enables DevOps and monitoring notifications in Slack and Amazon Chime. | |
AWS Rekognition | AWS Rekognition is a deep learning-based image and video analysis service that can identify objects, people, text, scenes, and activities, as well as detect inappropriate content. | |
AWS Comprehend | AWS Comprehend is a natural language processing (NLP) service that uses machine learning to uncover insights and relationships in text. | |
Amazon Q | An AI-powered assistant for AWS that helps developers and IT professionals answer questions, write code, and troubleshoot faster. | |
Amazon Bedrock | A fully managed service that allows you to build and scale generative AI applications using foundation models via API without managing infrastructure. | |
Amazon Translate | A neural machine translation service that delivers fast, high-quality language translation. | |
Amazon Transcribe | An automatic speech recognition (ASR) service that converts speech to text quickly and accurately. | |
AWS Partyrock | A no-code generative AI app builder that lets you create and share AI-powered apps easily using Amazon Bedrock. | |
Amazon Mechnical Turk | A crowdsourcing marketplace that enables businesses to outsource tasks to a global workforce. | |
Amazon Augmented Intelligence(A2I) | A service that enables human review of machine learning predictions to improve accuracy and compliance. | |
AWS Kendra | An intelligent search service powered by machine learning that enables organizations to search unstructured data using natural language. | |
AWS Textract | Amazon Textract is a machine learning service that automatically extracts text, handwriting, and data from scanned documents and forms. | |
SageMaker Specialization Services | SageMaker Ground Truth | A data labeling service to build highly accurate training datasets for machine learning quickly. |
SageMaker Canvas | Enables business analysts to build machine learning models and generate accurate predictions without writing code. | |
SageMaker Experiments | A tool to organize, track, compare, and evaluate machine learning experiments. | |
SageMaker Model Monitor | Monitors deployed models in production for data and model quality issues and automatically detects and alerts on potential problems. | |
SageMaker Pipelines | A service to build, automate, and manage endto-end machine learning workflows. | |
SageMaker Model Registery | A centralized repository to store, version, and manage machine learning models. | |
SageMaker Feature Store | A purpose-built repository for storing, retrieving, and sharing machine learning features. | |
SageMaker Inference Recommender | Helps select the best compute instance and configuration for inference workloads by running benchmark tests on different configurations. | |
SageMaker Serverless Inference | Allows serving machine learning models without managing infrastructure, automatically scaling based on traffic patterns. | |
SageMaker Real time Inference | Provides persistent endpoints for real-time inference that are fully managed and can automatically scale. | |
SageMaker Batch Transform | A service for offline inference that processes large datasets in batches. | |
SageMaker Asynchronous Interface | Supports workloads that involve large payloads or have long inference processing times, decoupling request and response so clients don't have to wait for the inference response. | |
AWS Analytics and Streaming Services | AWS Kinesis | AWS Kinesis is a fully managed service for collecting, processing, and analyzing real-time streaming data at scale. |
AWS Glue | AWS Glue is a fully managed extract, transform, and load (ETL) service that helps prepare and move data for analytics and machine learning. | |
Amazon Athena | A serverless query service that lets you analyze data directly in Amazon S3 using standard SQL. | |
Amazon EMR | A managed cluster platform that simplifies running big data frameworks like Apache Spark and Hadoop. | |
Amazon QuickSight | A scalable business intelligence service for creating interactive dashboards and visualizations. | |
AWS Lake Formation | A service that simplifies the process of building, securing, and managing data lakes on AWS. | |
AWS Data Migration Services | AWS Snowball | AWS Snowball is a physical data transfer device that helps move large amounts of data into or out of AWS securely and efficiently. |
AWS Data Migration Service (DMS) | Helps migrate databases to AWS quickly and securely with minimal downtime. | |
AWS DataSync | Automates and accelerates moving large amounts of data between on-premises storage and AWS services. | |
AWS Transfer Family | Enables secure file transfers directly into and out of Amazon S3 using SFTP, FTPS, and FTP. | |
AWS Notification Services | AWS SNS | AWS SNS (Simple Notification Service) is a fully managed messaging service for sending notifications or messages to distributed systems, applications, or users. |
AWS SQS | AWS SQS (Simple Queue Service) is a fully managed message queuing service that enables decoupling and scaling of microservices, distributed systems, and serverless applications. | |
AWS Billing and Cost Management | AWS Budgets | AWS Budgets enables you to set custom cost and usage budgets and receive alerts when thresholds are exceeded. |
AWS Organizations | AWS Organizations lets you centrally manage and govern multiple AWS accounts for consolidated billing and policy control. | |
AWS Support | AWS Support provides technical assistance and guidance through various support plans, including 24/7 access to AWS experts. | |
AWS Pricing Calculator | AWS Pricing Calculator helps you estimate the cost of AWS services based on your specific configuration and usage needs. |