Skip to content

AI & Machine Learning Terms

Essential terminology for Artificial Intelligence, Machine Learning, Deep Learning, and related technologies.

A metric measuring the proportion of correct predictions made by a model out of all predictions.

An entity that perceives its environment through sensors and acts upon that environment through actuators.

A step-by-step procedure or set of rules for solving a problem or performing a computation in AI/ML.

The simulation of human intelligence in machines programmed to think, learn, and problem-solve.

A computing system inspired by biological neural networks, consisting of interconnected nodes (neurons) that process information.

An algorithm for training neural networks by calculating gradients and adjusting weights to minimize error.

In ML, a systematic error introduced by the model’s assumptions; also refers to a parameter in neural networks that helps fit data better.

Extremely large datasets that require specialized tools and techniques for storage, processing, and analysis.

A supervised learning task where the model predicts discrete class labels for input data.

An unsupervised learning technique that groups similar data points together based on their characteristics.

A field of AI that enables computers to interpret and understand visual information from the world.

A deep learning architecture specialized for processing grid-like data such as images.

Techniques for artificially increasing the size and diversity of training data by applying transformations.

A subset of machine learning using neural networks with multiple layers to learn hierarchical representations of data.

Techniques for reducing the number of features in a dataset while preserving important information.

A representation of data (text, images) in a lower-dimensional continuous vector space, capturing semantic relationships.

One complete pass through the entire training dataset during model training.

Quantitative measures used to assess model performance (accuracy, precision, recall, F1-score, etc.).

An individual measurable property or characteristic of a phenomenon being observed, used as input for ML models.

The process of creating new features or transforming existing ones to improve model performance.

The process of adapting a pre-trained model to a specific task by continuing training on task-specific data.

AI systems that can create new content (text, images, music, code) based on training data.

An optimization algorithm that iteratively adjusts model parameters to minimize the loss function.

When an AI model generates plausible-sounding but incorrect or nonsensical information.

Configuration settings for ML algorithms that are set before training begins (learning rate, batch size, number of layers).

The process of using a trained model to make predictions on new, unseen data.

The degree to which humans can understand and explain the decisions made by an ML model.

The output or target variable in supervised learning that the model is trained to predict.

A type of AI model trained on vast amounts of text data to understand and generate human-like text.

A hyperparameter that controls how much model weights are adjusted during training.

A mathematical function that measures how well a model’s predictions match the actual values.

A subset of AI that enables systems to learn and improve from experience without being explicitly programmed.

A mathematical representation trained on data to make predictions or decisions.

The process of learning patterns from data by adjusting model parameters to minimize error.

A field of AI focused on enabling computers to understand, interpret, and generate human language.

A computational model inspired by the human brain’s structure and function, consisting of interconnected nodes.

When a model learns training data too well, including noise, resulting in poor performance on new data.

The process of finding the best parameters for a model to minimize error or maximize performance.

The simplest form of a neural network, consisting of a single neuron that makes binary decisions.

A metric measuring the proportion of true positive predictions out of all positive predictions made.

A model that has already been trained on a large dataset and can be fine-tuned or used for transfer learning.

Input text or instructions given to a language model to generate a specific response or perform a task.

The practice of crafting effective prompts to get desired outputs from AI models, especially LLMs.

A metric measuring the proportion of true positive predictions out of all actual positive instances.

A neural network architecture designed for sequential data, with connections that form cycles.

A supervised learning task where the model predicts continuous numerical values.

A learning paradigm where an agent learns to make decisions by receiving rewards or penalties for actions.

A technique combining information retrieval with generative models to produce more accurate and grounded responses.

A learning approach where models are trained on labeled data with known input-output pairs.

A supervised learning algorithm used for classification and regression tasks.

The dataset used to train a machine learning model, consisting of input features and corresponding labels.

A technique where a model trained on one task is adapted for a different but related task.

A deep learning architecture based on attention mechanisms, fundamental to modern NLP models like GPT and BERT.

When a model is too simple to capture the underlying patterns in data, resulting in poor performance.

A learning approach where models find patterns in data without labeled outputs.

A subset of data used to evaluate model performance during training and tune hyperparameters.

The amount by which model predictions vary when trained on different datasets.

A parameter in neural networks that represents the strength of connection between neurons, adjusted during training.


These AI and ML terms provide a foundation for understanding modern artificial intelligence and machine learning technologies.