Robin Marillia

Robin Marillia

FleetFleet
Senior software engineer
49 points

Forums

The differences between prompt context, RAG, and fine-tuning and why we chose prompting

When integrating internal knowledge into AI applications, three main approaches stand out:

1. Prompt Context  Load all relevant information into the context window and leverage prompt caching.
2. Retrieval-Augmented Generation (RAG)  Use text embeddings to fetch only the most relevant information for each query.
3. Fine-Tuning  Train a foundation model to better align with specific needs.

Each approach has its own strengths and trade-offs: