chat-with-arxiv

chat-with-arxiv

Build interactive chat agents for exploring and discussing academic research papers from ArXiv. Covers paper retrieval, content processing, question-answering, and research synthesis. Use when building research assistants, paper summarization tools, academic knowledge bases, or scientific literature chatbots.

1スター
0フォーク
更新日 1/15/2026
SKILL.md
readonlyread-only
name
chat-with-arxiv
description

Build interactive chat agents for exploring and discussing academic research papers from ArXiv. Covers paper retrieval, content processing, question-answering, and research synthesis. Use when building research assistants, paper summarization tools, academic knowledge bases, or scientific literature chatbots.

Chat with ArXiv

Build intelligent agents that understand, discuss, and synthesize academic research papers from ArXiv, enabling conversational exploration of scientific literature.

Overview

ArXiv chat agents combine:

  • Paper Discovery: Search and retrieve relevant research
  • Content Processing: Extract and understand paper content
  • Question Answering: Answer questions about papers
  • Research Synthesis: Identify connections between papers
  • Conversational Interface: Natural discussion about research

Applications

  • Research assistant for literature review
  • Paper summarization and explanation
  • Topic exploration across multiple papers
  • Citation analysis and connection finding
  • Trend identification in research areas
  • Thesis and dissertation support

Architecture

User Query
    ↓
Query Classifier (Paper Search vs Q&A)
    ├→ Paper Search
    │  ├ Query ArXiv API
    │  ├ Retrieve papers
    │  └ Process metadata
    │
    ├→ Question Answering
    │  ├ Retrieve relevant papers
    │  ├ Extract relevant sections
    │  ├ Generate answer with LLM
    │  └ Cite sources
    │
    └→ Conversational Analysis
       ├ Analyze paper relationships
       ├ Identify themes
       └ Synthesize findings
    ↓
Response with Citations

Paper Discovery and Retrieval

1. ArXiv API Integration

See examples/arxiv_paper_retriever.py for ArXivPaperRetriever:

  • Search papers by query with relevance ranking
  • Search by category, author, or title keywords
  • Retrieve trending papers by category and date range
  • Find similar papers to a given paper
  • Extract key terms from paper abstracts

2. Paper Content Processing

See examples/paper_content_processor.py for PaperContentProcessor:

  • Download and extract PDF content
  • Parse paper structure (abstract, introduction, methodology, results, conclusion, references)
  • Extract citations from papers
  • Cache processed papers for performance
  • Chunk papers for RAG integration

Question Answering System

1. RAG-Based QA

See examples/paper_question_answerer.py for PaperQuestionAnswerer:

  • Search for relevant papers from ArXiv
  • Download and process papers
  • Chunk papers for RAG retrieval
  • Retrieve most relevant chunks using embeddings
  • Generate answers with proper citations

2. Multi-Paper Synthesis

Build synthesis capabilities to:

  • Analyze multiple papers on a topic
  • Extract key findings and conclusions
  • Identify common research themes
  • Generate comprehensive synthesis of research area

Conversational Interface

1. Multi-Turn Conversation

See examples/arxiv_chatbot.py for ArXivChatbot:

  • Maintain conversation history
  • Classify query types (single paper Q&A, multi-paper synthesis, trends, general)
  • Handle single paper questions with citations
  • Handle synthesis queries across multiple papers
  • Detect and retrieve research trends
  • Generate contextual responses

2. Context Management

Build context management to:

  • Track current discussion topic
  • Remember discussed papers
  • Find related papers in conversation
  • Summarize discussion progress

Best Practices

Paper Retrieval

  • ✓ Use specific queries for better results
  • ✓ Limit results to relevant papers (max 50-100)
  • ✓ Cache downloaded papers locally
  • ✓ Handle API rate limits
  • ✓ Validate PDF extraction

Question Answering

  • ✓ Always cite sources with ArXiv IDs
  • ✓ Use multiple paper perspectives
  • ✓ Acknowledge uncertainties
  • ✓ Highlight conflicting findings
  • ✓ Suggest related papers

Conversation Management

  • ✓ Maintain conversation history
  • ✓ Track discussed papers
  • ✓ Clarify ambiguous queries
  • ✓ Suggest follow-up questions
  • ✓ Provide paper recommendations

Implementation Checklist

  • [ ] Set up ArXiv API client
  • [ ] Implement paper retrieval
  • [ ] Create PDF processing pipeline
  • [ ] Build RAG system for QA
  • [ ] Implement multi-paper synthesis
  • [ ] Create conversational interface
  • [ ] Add search filtering
  • [ ] Set up caching system
  • [ ] Implement citation formatting
  • [ ] Add error handling and logging
  • [ ] Test across research areas

Resources

ArXiv API

Paper Processing

RAG and QA

Citation Management

You Might Also Like

Related Skills

verify

verify

243K

Use when you want to validate changes before committing, or when you need to check all React contribution requirements.

facebook avatarfacebook
入手
test

test

243K

Use when you need to run tests for React core. Supports source, www, stable, and experimental channels.

facebook avatarfacebook
入手

Use when feature flag tests fail, flags need updating, understanding @gate pragmas, debugging channel-specific test failures, or adding new flags to React.

facebook avatarfacebook
入手

Use when adding new error messages to React, or seeing "unknown error code" warnings.

facebook avatarfacebook
入手
flow

flow

243K

Use when you need to run Flow type checking, or when seeing Flow type errors in React code.

facebook avatarfacebook
入手
flags

flags

243K

Use when you need to check feature flag states, compare channels, or debug why a feature behaves differently across release channels.

facebook avatarfacebook
入手