Fine-Tuning, RAG, and Agents

15 min

What you will learn

Understand what fine-tuning means and when it's used
Explain RAG and why it's relevant for business use cases
Know what AI agents are and their current limitations

1 of 7

Fine-Tuning: Customizing the Model Itself

Fine-tuning takes a pre-trained foundation model and trains it further on a specific dataset. The result is a model that retains its general capabilities but develops specialized expertise in a particular domain, task, or style.

The Analogy

A foundation model like GPT-4 or Claude is like a college graduate with a broad liberal arts education. They can write, analyze, and reason about many topics, but they are not specialists. Fine-tuning is like sending that graduate through a two-year specialized program — medical residency, legal clerkship, or financial analyst training. They come out the other side with the same general intelligence but dramatically better performance in their specialty.

What Fine-Tuning Actually Changes

When you fine-tune a model, you are adjusting the model's internal weights (parameters) based on new training examples. The model learns:

Domain vocabulary and jargon: A model fine-tuned on legal documents will naturally use legal terminology correctly and understand its nuances.
Task-specific patterns: A model fine-tuned on customer support conversations will learn the flow of troubleshooting dialogues.
Style and tone: A model fine-tuned on your company's communications will match your brand voice.
Output format preferences: A model fine-tuned on structured reports will naturally produce structured output.

When Fine-Tuning Makes Sense

Fine-tuning is expensive and complex. It is not the first tool you should reach for. Here is a decision framework:

Fine-tune when: - You need consistent, specialized behavior across thousands of interactions - The base model consistently fails at your specific task even with excellent prompting - You have a large, high-quality dataset of examples (hundreds to thousands of input-output pairs) - The domain requires specialized vocabulary or reasoning that prompting alone cannot achieve - You need to reduce costs at scale (a fine-tuned smaller model can outperform a larger general model on specific tasks, at lower per-token cost)

Do NOT fine-tune when: - Better prompting could solve the problem (try this first, always) - You only have a few dozen examples (not enough data) - Your needs change frequently (fine-tuning is slow and expensive to redo) - You need the model to access current information (fine-tuning does not add knowledge — it adjusts behavior)

Cost Considerations

Fine-tuning costs include: - Training compute: Running the training job on GPU infrastructure. For GPT-4 class models, this can cost hundreds to thousands of dollars per training run. - Data preparation: Cleaning, formatting, and validating your training dataset. This is often the most time-consuming part. - Iteration: You will likely need multiple training runs to get good results. Each run costs money and time. - Hosting: If you fine-tune an open-source model, you need infrastructure to serve it. If you fine-tune through an API provider, you typically pay a premium per-token rate. - Maintenance: Models degrade over time as the world changes. You may need to retrain periodically.

For most businesses, the total cost of a fine-tuning project ranges from $5,000 to $100,000+ depending on scale and complexity.

←→navigatespacecontinue

Knowledge check

1 of 1

What does RAG stand for and what does it do?

Key takeaway

Fine-tuning customizes the model itself, RAG feeds it real-time data, and agents let AI take actions. You don't need to build these — but understanding them helps you evaluate tools and vendors.

Practice Exercise

Hands-on practice — do this now to lock in what you learned

Open an AI assistant and try this:

Think about your own work and identify one scenario for each concept: 1. **Fine-tuning scenario:** What domain-specific knowledge would make AI more useful in your field? (Example: if you're in law, a model fine-tuned on case law) 2. **RAG scenario:** What internal documents would you want AI to search through? (Example: company policies, product documentation) 3. **Agent scenario:** What multi-step task would you want AI to handle end-to-end? (Example: research a topic, compile findings, draft a summary email) Write these down. They'll help you evaluate AI tools for your specific needs.

Open in ChatGPT

Or try it in:Claude Gemini Perplexity

+10 XP when completed

Write your prompt attempt here (optional)