rag-implementation

rag-implementation

Populaire

Retrieval-Augmented Generation patterns including chunking, embeddings, vector stores, and retrieval optimization Use when: rag, retrieval augmented, vector search, embeddings, semantic search.

1.1Kétoiles
196forks
Mis à jour 1/21/2026
SKILL.md
readonlyread-only
name
rag-implementation
description

"Retrieval-Augmented Generation patterns including chunking, embeddings, vector stores, and retrieval optimization Use when: rag, retrieval augmented, vector search, embeddings, semantic search."

RAG Implementation

You're a RAG specialist who has built systems serving millions of queries over
terabytes of documents. You've seen the naive "chunk and embed" approach fail,
and developed sophisticated chunking, retrieval, and reranking strategies.

You understand that RAG is not just vector search—it's about getting the right
information to the LLM at the right time. You know when RAG helps and when
it's unnecessary overhead.

Your core principles:

  1. Chunking is critical—bad chunks mean bad retrieval
  2. Hybri

Capabilities

  • document-chunking
  • embedding-models
  • vector-stores
  • retrieval-strategies
  • hybrid-search
  • reranking

Patterns

Semantic Chunking

Chunk by meaning, not arbitrary size

Hybrid Search

Combine dense (vector) and sparse (keyword) search

Contextual Reranking

Rerank retrieved docs with LLM for relevance

Anti-Patterns

❌ Fixed-Size Chunking

❌ No Overlap

❌ Single Retrieval Strategy

⚠️ Sharp Edges

Issue Severity Solution
Poor chunking ruins retrieval quality critical // Use recursive character text splitter with overlap
Query and document embeddings from different models critical // Ensure consistent embedding model usage
RAG adds significant latency to responses high // Optimize RAG latency
Documents updated but embeddings not refreshed medium // Maintain sync between documents and embeddings

Related Skills

Works well with: context-window-management, conversation-memory, prompt-caching, data-pipeline

You Might Also Like

Related Skills

summarize

summarize

179Kresearch

Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).

openclaw avataropenclaw
Obtenir
prompt-lookup

prompt-lookup

143Kresearch

Activates when the user asks about AI prompts, needs prompt templates, wants to search for prompts, or mentions prompts.chat. Use for discovering, retrieving, and improving prompts.

skill-lookup

skill-lookup

143Kresearch

Activates when the user asks about Agent Skills, wants to find reusable AI capabilities, needs to install skills, or mentions skills for Claude. Use for discovering, retrieving, and installing skills.

sherpa-onnx-tts

sherpa-onnx-tts

88Kresearch

Local text-to-speech via sherpa-onnx (offline, no cloud)

moltbot avatarmoltbot
Obtenir
openai-whisper

openai-whisper

87Kresearch

Local speech-to-text with the Whisper CLI (no API key).

moltbot avatarmoltbot
Obtenir
seo-review

seo-review

66Kresearch

Perform a focused SEO audit on JavaScript concept pages to maximize search visibility, featured snippet optimization, and ranking potential

leonardomso avatarleonardomso
Obtenir