Concept #143Mediumextended-ai-concepts

What is the difference between a foundational model and Gen AI?

#gen-ai#llm

Answer

Foundational Model vs Gen AI

These terms are related but describe different things: foundational model refers to the type of model architecture, while Gen AI describes the category of AI application.

Definitions

Foundation ModelGenerative AI
What it isLarge pre-trained model that can be adaptedAI that creates new content
FocusHow it's built and trainedWhat it does
RelationshipGen AI uses foundation modelsFoundation models power Gen AI
ExamplesGPT-4, Claude, Llama, BERTChatGPT, DALL-E, Midjourney

Foundation Model

A foundation model is a large-scale model trained on broad, diverse data using self-supervised learning — creating a general-purpose base that can be fine-tuned or prompted for many tasks.

Key characteristics:

  • Scale — billions of parameters
  • Broad training — web text, code, books, images (massive datasets)
  • General-purpose — not trained for one specific task
  • Adaptable — fine-tune or prompt-engineer for specific tasks
  • Emergent abilities — capabilities that weren't explicitly trained
python
from transformers import AutoModel, AutoTokenizer

# A foundation model used as-is or fine-tuned for specific tasks
base_model = AutoModel.from_pretrained("bert-base-uncased")  # Foundation model

# Fine-tune for specific downstream task
from transformers import AutoModelForSequenceClassification
classifier = AutoModelForSequenceClassification.from_pretrained(
    "bert-base-uncased",
    num_labels=2
)
# Now BERT (foundation) → sentiment classifier (specific task)

Not all foundation models are generative:

  • BERT — foundation model for NLP (encoder-only, NOT generative)
  • GPT-4 — foundation model AND generative
  • CLIP — vision-language foundation model (NOT generative)
  • ViT — vision transformer foundation model (NOT generative)

Generative AI

Generative AI describes AI that creates new content — text, images, audio, code, video.

Key characteristics:

  • Creates new content — not just classifying or retrieving
  • Outputs novel artifacts — text, images, music, code
  • Usually uses foundation models — but the focus is on the generation capability
python
# Gen AI in action — creating new content
from openai import OpenAI
client = OpenAI()

# Text generation (Gen AI using GPT foundation model)
text_response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Write a poem about AI"}]
)

# Image generation (Gen AI using DALL-E foundation model)
image_response = client.images.generate(
    model="dall-e-3",
    prompt="A futuristic city powered by AI",
    size="1024x1024"
)

The Relationship

text
Foundation Models (the infrastructure)
        ↓ power
Generative AI Applications (the use case)
        ↓ enable
User-facing products (ChatGPT, Midjourney, GitHub Copilot)

Venn Diagram

text
ā”Œā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”
│         Foundation Models       │
│  ā”Œā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”  │
│  │    Generative AI Models  │  │
│  │    (GPT, Claude, DALL-E) │  │
│  ā””ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”˜  │
│  BERT, CLIP, ViT               │
│  (not generative)              │
ā””ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”˜

Summary

All generative AI models used today are foundation models, but not all foundation models are generative. "Foundation model" describes architectural approach; "Gen AI" describes the output capability (creating new content).