firecrawl-parse

Name: firecrawl-parse
Author: firecrawl

firecrawl parse

Turn a local document into clean markdown on disk. Supports PDF, DOCX, DOC, ODT, RTF, XLSX, XLS, HTML/HTM/XHTML.

When to use

You have a file on disk (not a URL) and want its text as markdown
User drops a PDF/DOCX and asks what it says, or to summarize it
Use scrape instead when the source is a URL

Quick start

Always save to .firecrawl/ with -o — parsed docs can be hundreds of KB and blow up context if streamed to stdout. Add .firecrawl/ to .gitignore.

mkdir -p .firecrawl

# File → markdown
firecrawl parse ./paper.pdf -o .firecrawl/paper.md

# AI summary
firecrawl parse ./paper.pdf -S -o .firecrawl/paper-summary.md

# Ask a question about the doc
firecrawl parse ./paper.pdf -Q "What are the main conclusions?" \
  -o .firecrawl/paper-qa.md

Then head, grep, rg etc., or incrementally read the file - don't load the whole thing at once.

Options

Option	Description
`-S, --summary`	AI-generated summary
`-Q, --query <prompt>`	Ask a question about the parsed content
`-o, --output <path>`	Output file path — always use this
`-f, --format <fmt>`	`markdown` (default), `html`, `summary`
`--timeout <ms>`	Timeout for the parse job
`--timing`	Show request duration

Tips

Quote paths with spaces: firecrawl parse "./My Doc.pdf" -o .firecrawl/mydoc.md.
Max upload size: 50 MB per file.
Credits: ~1 per PDF page; HTML is 1 flat.
Check .firecrawl/ before re-parsing the same file.
To check your credit balance (recommended for batch processing and similar workflows), use the firecrawl credit-usage command.

Related Skills

summarize

380Kresearch-knowledge

Summarize or transcribe URLs, YouTube/videos, podcasts, articles, transcripts, PDFs, and local files.

steipete

获取

writing-skills

233Kresearch-knowledge

Use when creating new skills, editing existing skills, or verifying skills work before deployment

obra

获取

Guide users through a structured workflow for co-authoring documentation. Use when user wants to write documentation, proposals, technical specs, decision docs, or similar structured content. This workflow helps users efficiently transfer context, refine content through iteration, and verify the doc works for readers. Trigger when user mentions writing docs, creating proposals, drafting specs, or similar documentation tasks.

anthropics

获取

claude-api

153Kresearch-knowledge

anthropics

获取

mcp-builder

153Kresearch-knowledge

Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use when building MCP servers to integrate external APIs or services, whether in Python (FastMCP) or Node/TypeScript (MCP SDK).

anthropics

获取

xlsx

152Kresearch-knowledge

Use this skill any time a spreadsheet file is the primary input or output. This means any task where the user wants to: open, read, edit, or fix an existing .xlsx, .xlsm, .csv, or .tsv file (e.g., adding columns, computing formulas, formatting, charting, cleaning messy data); create a new spreadsheet from scratch or from other data sources; or convert between tabular file formats. Trigger especially when the user references a spreadsheet file by name or path — even casually (like \"the xlsx in my downloads\") — and wants something done to it or produced from it. Also trigger for cleaning or restructuring messy tabular data files (malformed rows, misplaced headers, junk data) into proper spreadsheets. The deliverable must be a spreadsheet file. Do NOT trigger when the primary deliverable is a Word document, HTML report, standalone Python script, database pipeline, or Google Sheets API integration, even if tabular data is involved.

anthropics

获取

firecrawl-parse

firecrawl parse

When to use

Quick start

Options

Tips

See also

You Might Also Like

Related Skills

summarize

writing-skills

doc-coauthoring

claude-api

mcp-builder

xlsx