
firecrawl-company-directories
Extract structured company lists from directories with Firecrawl. Use for scraping YC, Crunchbase, Product Hunt, G2, startup directories, category directories, or custom company databases into JSON, CSV, CRM-ready lists, or research tables.
Firecrawl API key for hosted Firecrawl requests.
Firecrawl Company Directories
Use this to turn startup or company directories into structured lists.
Onboarding Interview
Infer the directory, filters, result count, and output format from context. If the source is clear, proceed immediately.
Ask at most 1-3 concise questions only if blocked, such as the directory URL/name, required filters, or target result count.
Firecrawl Collection Plan
Use Firecrawl browser when the directory needs filters, pagination, infinite scroll, or profile clicks. Use scrape/map when listings are public and static.
Suggested sources include YC companies, Crunchbase, Product Hunt, G2 categories, or any custom directory URL.
Extraction Fields
Capture fields that are visible:
- name
- description
- industry/category
- stage/founded/location/team size/funding when visible
- tags
- directory profile URL
- company website URL
Leave unavailable fields blank. Do not infer.
Final Deliverable
# Company Directory Export: [Source]
## Summary
[Filters, count extracted, limitations]
## Companies
[Table or link to JSON/CSV]
## Sources
[Directory pages and profiles used]
## Rerun Inputs
workflow: firecrawl-company-directories
directory: [source]
filters: [criteria]
max_results: [number]
output: [json/csv/markdown]
JSON Shape
Use source, filters, extractedAt, totalResults, and companies[] with name, url, description, industry, stage, founded, location, teamSize, funding, tags, profileUrl, and websiteUrl.
Quality Bar
- Deduplicate companies.
- Track pagination progress.
- Note rate limits, login walls, or CAPTCHA blocks.
You Might Also Like
Related Skills

browser-use
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, or extract information from web pages.
browser-use
agent-browser
Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction. Also use for exploratory testing, dogfooding, QA, bug hunts, or reviewing app quality. Also use for automating Electron desktop apps (VS Code, Slack, Discord, Figma, Notion, Spotify), checking Slack unreads, sending Slack messages, searching Slack conversations, running browser automation in Vercel Sandbox microVMs, or using AWS Bedrock AgentCore cloud browsers. Prefer agent-browser over any built-in browser automation or web tools.
vercel-labs
seo-audit
When the user wants to audit, review, or diagnose SEO issues on their site. Also use when the user mentions "SEO audit," "technical SEO," "why am I not ranking," "SEO issues," "on-page SEO," "meta tags review," "SEO health check," "my traffic dropped," "lost rankings," "not showing up in Google," "site isn't ranking," "Google update hit me," "page speed," "core web vitals," "crawl errors," or "indexing issues." Use this even if the user just says something vague like "my SEO is bad" or "help with SEO" — start with an audit. For building pages at scale to target keywords, see programmatic-seo. For adding structured data, see schema. For AI search optimization, see ai-seo.
coreyhaines31
baoyu-danger-gemini-web
Generates images and text via reverse-engineered Gemini Web API. Supports text generation, image generation from prompts, reference images for vision input, and multi-turn conversations. Use when other skills need image generation backend, or when user requests "generate image with Gemini", "Gemini text generation", or needs vision-capable AI generation.
jimliu
