Book, AI Terminology

August 27, 2018

|

AI Industry Insights

AI Terminology For Business Leaders [Beta]

No items found.

Artificial Intelligence is a complicated, rapidly evolving field. Actual competency is mainly seen by “AI First” companies like Google, Amazon, Netflix, Baidu, and Facebook. However, artificial intelligence should not be ignored by everyone else. If you are unclear if AI will be a thing, check out the patent and investment trends over the last decade. AI is however a very noisy topic. One of the biggest barriers we see while advocating for AI adoption is a communication barrier. Many strategy leaders are intimidated by AI and either unaware or unsure about the terminology. If we want to democratize AI, we need to be able to communicate and share ideas. That requires a shared vocabulary. It requires a dictionary, if you will.

This is a dictionary for business/strategy leaders. Why? Because artificial intelligence requires support from the C-team to work. Otherwise projects and data will remain in silos with limited measurable impact. Compared to other glossaries out there, this dictionary will be different:

This dictionary will be collaborative and require input from other technical and strategy practitioners. We encourage you to give input on key terms and techniques every strategy leader should know for adopting AI into their organization. With input from techies and non-techies alike, together we can make this an expansive resource.
This AI dictionary will include resources to dive deeper into terms and techniques for further exploration, key issues, and training.

We need your help! We need help building out our list of key terminology you think every business/strategy leader needs to know. We also need to add great resources for readers to explore further. If you have something to share, please share! You can email your additions to (steve@kungfu.ai) or add them to the comment section as we spread this around. Let’s see where this goes.

A

Artificial Intelligence

Intelligent machines that perceive the world around them, form plans, and make decisions to achieve their goals. Its foundations include mathematics, logic, philosophy, probability, linguistics, neuroscience, and decision theory.

Resources:

Technical and business-level introductions into AI: https://www.kungfu.ai/0-60-in-ai-for-free-how-to-get-smart-in-ai-fast/

Artificial Narrow Intelligence (ANI)

Artificial intelligence which can effectively perform a narrowly defined task, such as computer vision, robotics, machine learning, and natural language processing.

Artificial General Intelligence (AGI)

Also known as strong AI, is an artificial intelligence that can successfully perform any intellectual task that a human being can, including learning, planning and decision-making under uncertainty, communicating in natural language, making jokes, manipulating people, trading stocks, or... reprogramming itself.

Artificial Superintelligence (ASI)

An ultra-intelligent machine that can surpass all the intellectual activities of any person, however clever.

B

Bias

is the amount of error introduced by approximating real-world phenomena with a simplified model. Dataset used in AI can be commonly biased towards race, gender, and ethnicity which can put projects at risk.

Resources:

This risk of bias in AI: http://fortune.com/longform/ai-bias-problem/

C

Categories

Data points, often words or titles used as inputs in machine learning to train or test an algorithm.

Classification

A model that outputs the probability of a categorical target variable Y belonging to a certain class. For example, is this a picture of a cat or a dog?

Clustering

The goal of clustering is to create groups of data points such that points in different clusters are dissimilar while points within a cluster are similar. Clustering can be considered the most important unsupervised learning problem; so, as every other problem of this kind, it deals with finding a structure in a collection of unlabeled data, finding relationships between data points in dimensional space.

Convolutional Neural Networks (CNNs)

CNNs are designed specifically for taking images as input, and are effective for computer vision tasks. They are also instrumental in deep reinforcement learning. CNNs are specifically inspired by the way animal visual cortices work

Resource:

Understanding CNNs: https://towardsdatascience.com/intuitively-understanding-convolutions-for-deep-learning-1f6f42faee1

D

Deep learning

A subset of machine learning in Artificial Intelligence (AI) that has networks capable of learning unsupervised from data that is unstructured or unlabeled. A subset of machine learning, deep learning (DL) is distinct in that it is composed of multiple layers, typically between 10 and 100 (hence ‘deep’) in contrast to machine learning algorithms which tends to only have one or two. Each layer of the network is responsible for the detection of one characteristic about the inputs, and computations at each level base assumptions/build upon previous levels, which allows the network to “learn” more nuanced and abstract characteristics to determine the output.

Resources:

Business impact of Deep Learning: https://www.kaleidoinsights.com/impact-analysis-business-impacts-of-deep-learning/
Difference between Machine Learning and Deep Learning (NN): https://www.zendesk.com/blog/machine-learning-and-deep-learning/

Dimensionality Reduction

Dimensionality reduction looks a lot like compression. This is about trying to reduce the complexity of the data while keeping as much of the relevant structure as possible.

E

Explainability

Or Transparent AI is an artificial intelligence (AI) whose actions can be easily understood by humans. It contrasts with the concept of the "black box" in machine learning, meaning the "interpretability" of the workings of complex algorithms, where even their designers cannot explain why the AI arrived at a specific decision.

Resource:

More about explainability: https://en.wikipedia.org/wiki/Explainable_Artificial_Intelligence

F

Features

Data points, often numerical values used as inputs in machine learning to train or test an algorithm.

Feature Engineering

Feature engineering is the process of transforming raw data into features that better represent the underlying problem to the predictive models, resulting in improved model accuracy on unseen data. If feature engineering is done correctly, it increases the predictive power of machine learning algorithms by creating features from raw data that help facilitate the machine learning process.

Resource:

More on the importance of feature engineering: https://medium.com/mindorks/what-is-feature-engineering-for-machine-learning-d8ba3158d97a

Feature Extraction

Deep-learning models are capable of learning to focus on the right features by themselves, requiring little guidance from the programmer.

Feature Representation

Feature is an individual measurable property or characteristic (within data) of a phenomenon being observed.

Feature Scaling

A method used to standardize the range of independent variables or features of data. In data processing, it is also known as data normalization and is generally performed during the data preprocessing step.

G

Ground Truth Data

In machine learning, the term "ground truth" refers to the accuracy of the training set's classification for supervised learning techniques. This is used in statistical models to prove or disprove research hypotheses. The term "ground truthing" refers to the process of gathering the proper objective (provable) data for this test. Compare with gold standard.

H

Hyperparameter

A general setting of your model that can be increased or decreased (i.e. tuned) in order to improve performance. Represented in an equation by a lambda.

K

k-Nearest Neighbors (k-NN)

k-NN is to label a test data point x by finding the mean (or mode) of the k closest data points’ labels. You can measure similarity of data points by creating a vector representation of the items, and then compare the vectors using an appropriate distance metric (like the Euclidean distance, for example). k-NN is commonly used in concept search and product recommendations.

M

Machine learning

A subfield of artificial intelligence. An algorithm that allows computers to learn on their own. Machine Learning enables computers to identify patterns in observed data, build models that explain the world, and predict things without having explicit pre-programmed rules and models.

Resource:

Difference between Machine Learning and Deep Learning (NN): https://www.zendesk.com/blog/machine-learning-and-deep-learning/

Model

Once an machine learning algorithm has been trained on data, and learning occurs, the output of the process is a model. This can be used to make predictions.

N

Neural Network

A computing system made up of a number of simple, highly interconnected processing elements, which process information by their dynamic state response to external inputs (see Deep Learning). A NN model is designed to continually analyze data with a logic structure similar to how a human would draw conclusions. The design of a NN is inspired by the biological neural network of the human brain. This makes for machine intelligence that’s far more capable than that of standard machine learning models.

Resources:

About Neural Networks: http://pages.cs.wisc.edu/~bolo/shipyard/neural/local.html
Difference between Machine Learning and Deep Learning (NN): https://www.zendesk.com/blog/machine-learning-and-deep-learning/

O

Overfitting

Learning a function that perfectly explains the training data that the model learned from, but doesn’t generalize well to unseen test data. Overfitting happens when a model over learns from the training data to the point that it starts picking up idiosyncrasies that aren’t representative of patterns in the real world.

Resource:

Conceptual explanation of overfit and underfit: https://towardsdatascience.com/overfitting-vs-underfitting-a-conceptual-explanation-d94ee20ca7f9

R

Random Forests

An algorithm is a supervised classification algorithm. The decision tree is a decision support tool. It uses a tree-like graph to show the possible consequences. If you input a training dataset with targets and features into the decision tree, it will formulate some set of rules. These rules can be used to perform predictions.

Resource:

Deeper dive on Random Forrest: https://medium.com/@Synced/how-random-forest-algorithm-works-in-machine-learning-3c0fe15b6674

Regression

Predict a continuous numerical value.

Recurrent neural networks (RNNs)

RNNs have a sense of built-in memory and are well-suited for language problems. They’re also important in reinforcement learning since they enable the agent to keep track of where things are and what happened historically even when those elements aren’t all visible at once.

Reinforcement Learning (RL)

Intelligent machines that learn goal-oriented behavior by trial and error in an environment that rewards or penalizes in response to the agents actions towards achieving that goal.

S

Supervised Learning

An algorithm that identifies patterns in data (inputs) to form heuristics and then automate or predict a certain output. Requires manual inputs and expected outputs model to work.

Structured Data

Refers to information with a high degree of organization, such that inclusion in a relational database is seamless and readily searchable by simple, straightforward search engine algorithms or other search operations; whereas unstructured data is essentially the opposite. A lack of structure makes compilation a time and energy-consuming task. It would be beneficial to a company across all business strata to find a mechanism of data analysis to reduce the costs unstructured data adds to the organization.

Resource:

Structured versus unstructured data: https://brightplanet.com/2012/06/structured-vs-unstructured-data/

T

Training Data

A subset of data the represents the features of the desired output and used to train a model to make predictions or perform a desired behavior.

Test Data

A subset of data the represents the features of the desired output used to test a model for fidelity after training has occurred.

Target Variable

A label or numerical value we are trying to predict.

Transfer Learning

A practice in the field of machine learning to store knowledge gained by solving one problem and apply it to a different or related problem thereby reducing the need for additional training or compute. Transfer learning makes the development of machine learning more accessible and less resource intensive.

Resource:

New trends in transfer learning (2018): https://www.stateof.ai/

U

Unstructured Data

Information that either does not have a pre-defined data model or is not organized in a pre-defined manner. Unstructured information is typically text-heavy, but may contain data such as dates, numbers, and facts as well. Email is an example of unstructured data; because while the busy inbox of a corporate human resources manager might be arranged by date, time or size; if it were truly fully structured, it would also be arranged by exact subject and content, with no deviation or spread – which is impractical, because people don’t generally speak about precisely one subject even in focused emails.

Resource:

Structured versus unstructured data: https://brightplanet.com/2012/06/structured-vs-unstructured-data/

Unsupervised learning

is a type of machine learning algorithm used to draw inferences from datasets consisting of input data without labeled responses. The most common unsupervised learning method is cluster analysis, which is used for exploratory data analysis to find hidden patterns or grouping in data.

Resource:

Example methods of unsupervised learning: https://www.mathworks.com/discovery/unsupervised-learning.html

Underfitting

Instead of following the training data too closely (see overfit), a model that underfits the ignores the lessons from the training data and fails to learn the underlying relationship between inputs and outputs.

Resource:

Conceptual explanation of overfit and underfit: https://towardsdatascience.com/overfitting-vs-underfitting-a-conceptual-explanation-d94ee20ca7f9

V

Variance

is how much your model's test error changes based on variation in the training data. It reflects the model's sensitivity to the idiosyncrasies of the data set it was trained on. For example, an overfit has high variance and low bias. However an underfit has low variance and high bias.

Resource:

Conceptual explanation of overfit and underfit: https://towardsdatascience.com/overfitting-vs-underfitting-a-conceptual-explanation-d94ee20ca7f9

AI via Fierce Humanism - Building Better than Good Enough

Gartner® Research Identifies Shift Toward AI-Native Team Models; Cites KUNGFU.AI

Leadership in the Age of AI

Designing Organizations for AI-Driven Decision Making

Gartner® Identifies Fundamental Shift in AI Services Market; Cites KUNGFU.AI Among Emerging AI-Native Providers

The Super-Weight Phenomenon: What Hidden Parameters Reveal About Large Language Models

Does AI Coding Assistance Actually Improve Productivity?

2026: The Year AI Grows Up

How We Use AI to Engineer AI

Guiding America’s Boardrooms into the Age of AI

AI Leaders Summit: Exclusive One-on-one's with AI Experts

Don’t Poison Your Own Well with GenAI, Use it to Dig Deeper

You Made It to Production: Now What?

Rethinking the AI Development Lifecycle

Why 90% of AI Projects Fail Before They Launch

A Gold Medal Moment for AI

Part 3: How to Choose an AI Governance Model That Works for Your Organization

The Real Breakthrough Behind DeepSeek R1

Anthropic Cracks Open the Black Box of AI

Predicting Cancer Before It Starts: An AI Milestone in Women’s Health

Reinforcement Learning: AI’s Next Big Leap

Copyright, Fair Use, and the Fight Over AI Training Data

The Real Illusion in Apple’s “Illusion of Thinking” Paper

Part 2: Designing AI Governance That Works

Part 1: Why AI Governance is a Strategic Imperative

Most People Don't Expect AI to Benefit Them. What Can We Do About That?

From Brain to Machine: How Neuroscience Is Shaping the Future of AI

KUNGFU.AI Partners with NACD to Equip Boards for the Age of AI

What Does “Productivity” Mean in an AI-Enabled World?

The Emergence of Product Analytics: An Under-appreciated Yet Critical Part of AI Development

The Academic in Industry: A Cultural and Pragmatic Shift

AI & Authenticity—What Does It Mean to Be "Real" in 2025?

AI is Like a Road Trip: Why You Need a Flexible Strategy, Not Just a Destination

Why Most AI Implementations Fail—And How to Get It Right

Reclaiming Attention in the Age of AI

Are Agents the Future?

Tired of the Hype? Let’s Baseline 10 Commonly Misused AI Terms

KUNGFU.AI’s AI Hiring Survival Guide

Part 3: How to Procure AI Services Through an RFP Process

Data Science: Bridging the Gap Between Business and Analytics

Part 2: Planning for Next Year’s AI Budget: A Strategic Guide for C-Level Executives

Part 1: Building vs. Buying an AI Team: What’s Best for Your Business?

Mash-Up: AI and Potatoes USA Join Forces Against Misinformation

KUNGFU.AI Updates Ethical Pledge on Facial Recognition

3 Steps to Designing AI That Fits Like a Glove

LLMs are Engines. It’s Time for Vehicles.

Product Sense: A Hidden Lynchpin in Data Science and AI

Not Budgeting for AI Today is like Having Bet on the Slide Rule, Calculator or Fax

The Top AI Events We’re Looking Forward to in 2024

2024 Will Be The Year of The AI Budget

Engineering Explained: GPT-4V(ision)

KUNGFU.AI and CDAO Collaborate on AI Strategy for Defense Enterprise Ecosystem

Engineering Explained: Opportunity Sizing and ROI Analysis

Engineering Explained: Bayesian Mechanics

Celebrating Our Success: We Made the Inc. 5000 List of Fastest-Growing Private Companies in America!

10 Things Companies Should Think About When Devising an AI Strategy

Engineering Explained: Large Language Models

Engineering Explained: Diffusion Models

Understanding Data Science and Related Sub Sciences

KUNGFU.AI Joins Tradewinds’ Marketplace, Empowering Businesses with Cutting-Edge AI Services

How to Navigate the AI Industry: Join our Career Workshops

Innovation in the Age of Regulation: Building AI with Federated Learning

AI is the Future. ChatGPT is the assistant.

KUNGFU.AI’s Approach to Developing an ‘AI Center of Excellence’

KUNGFU.AI Joins INSA to Expand Government Partnerships and Reach

Data-Driven Decision-Making: Making Confident and Proactive Business Decisions

Navigating the Ethical Implications of Data Interpretation

Overcoming Cognitive Bias in Data Analysis and Decision-Making

ConvNeXt: A Transformer-Inspired CNN Architecture

How to Build a Great AI Engineering Team

Engineering Explained: LayoutLMv3 and the Future of Document AI

Turning Away Our First Client

AI Simplified: An Introduction to Artificial Intelligence

Introducing KUNGFU.AI Lab Days

Large Language Models: Three Stages of Adoption

The Future of AI: Can Open-Source Community Keep Up with Large Corporations?

How to Use ChatGPT: Our Step by Step Guide

What is ChatGPT? Everything You Need to Know.

Savimbo and KUNGFU.AI Partner to Bring AI to Rainforest Conservation

Data, Security, and Ethical Risks of AI Use in Healthcare

Engineering Explained: OpenAI's ChatGPT

4 Ways to Mitigate Bias and Prioritize Patients

We Used ChatGPT to Figure Out How Businesses Can Use ChatGPT

Want to WFH? Check Out These 10 Flexible Remote Companies

Where We Are and What's Coming

Meet the Team: Benjamin Klein

The First Mile of Any AI Project is Most Critical

Edge Computing for Business: What You Should Know

What You Should Know Before Investing in Computer Vision

KUNGFU.AI Presents: Using Computer Vision to Solve Business Challenges with WM

KUNGFU.AI Presents: Unlocking Greater Business Intelligence with Graphs

How Multitask Learning in Computer Vision Can Solve Your Business Challenges

Now Is the Time to Invest in Computer Vision and Secure a Competitive Advantage

Designing Your First NLP Annotation Job

Autism Acceptance Day

5 Ways to Realize ROI on AI investments

Join Us for Giving Tuesday

KUNGFU.AI Achieves Machine Learning Partner Specialization in the Google Cloud Partner Advantage Program

KUNGFU.AI Presents: The Obstacles in Building Product AI and How to Overcome Them

KUNGFU.AI Presents: The AI Ethical Imperative

Related resources

AI Industry Insights

AI via Fierce Humanism - Building Better than Good Enough

AI Industry Insights

Leadership in the Age of AI

AI Industry Insights

Designing Organizations for AI-Driven Decision Making