Blog

Get the latest news on building great LLM-powered apps with private data. We deep dive on LLM Ops, RAG, hallucinations and more
Tutorials

Breaking Down OpenAI o1 Model for RAG Engineers

OpenAI's o1 reasoning model: A step towards AI that thinks, promising improved rule-following and consistency, but not without skepticism and limitations.

Read Article
Research

Do Vector Databases Lose Accuracy at Scale?

Research shows vector databases lose accuracy at just 10,000 pages, but there's a way out.

Read Article
Tutorials

Fine-Tuning AI Models: A RAG Developer's Guide to Specialized Machine Learning

Dive into the world of AI fine-tuning, from basic concepts to cutting-edge techniques like LoRA. Learn how to specialize your models, avoid common pitfalls, and leverage fine-tuning for real-world applications.

Read Article
Tutorials

Optimizing RAG Systems with Advanced LLM Routing Techniques: A Deep Dive

LLM routing is transforming AI system architecture by intelligently directing prompts to the most suitable language models, balancing quality, speed, and cost for optimal performance.

Read Article
Tutorials

How AskVet is Transforming Veterinary Care with AI and RAG Technology

Learn how AskVet is using AI and Retrieval Augmented Generation (RAG) to revolutionize pet care. Discover their unique mood bucket and sentiment system and learn tips for developers to implement RAG and other AI solutions across industries like AI in veterinary services.

Read Article
Tutorials

Unlocking the Power of Agentic RAG: The Next Step for AI Problem-Solving

As AI continues to evolve, two technologies are converging to create a powerful new approach: Agentic RAG. Agentic RAG combines techniques from Retrieval Augmented Generation (RAG) with AI Agents (semi-autonomous AI) to push the boundaries of AI problem-solving.

Read Article
Tutorials

The AI Engineer's Guide to Document Parsing in RAG Applications

Understanding and optimizing your parsing strategy is one of the keys to building high-performance RAG applications. There are several popular parsing strategies and tools out there, each with their own strengths and limitations.

Read Article
Tutorials

Multimodal RAG Explained: Integrating Text, Images, Audio, and More in AI

Multimodal Retrieval-Augmented Generation (RAG) has emerged as a unique approach to increase efficiency and reliability of AI systems. This concept extends traditional text-based RAG systems to incorporate various data types such as images, audio, and video, creating richer and more contextually accurate information retrieval and generation.

Read Article
Tutorials

Understanding CRAG: Meta's Comprehensive Benchmark for Retrieval Augmented Generation

CRAG, or the Comprehensive RAG Benchmark, is Meta’s newest benchmark to evaluate AI performance. We break the latest benchmark down and evaluate its importance for AI engineers.

Read Article
Research

World's Most Accurate RAG? Langchain/Pinecone, LlamaIndex and EyeLevel Duke it Out

Winner achieved 98% accuracy across 1,000+ pages of complex documents

Read Article

Find out what the buzz is about. Learn to build AI you can trust.

Find out what the buzz is about. Learn to build AI you can trust.