Test

Tests, débogage

580 skills available

Showing 24 results

find-bugs

18Kdev-testing

Find bugs, security vulnerabilities, and code quality issues in local branch changes. Use when asked to review changes, find bugs, security review, or audit code on the current branch.

davila7

Obtenir

Evaluates code generation models across HumanEval, MBPP, MultiPL-E, and 15+ benchmarks with pass@k metrics. Use when benchmarking code models, comparing coding abilities, testing multi-language support, or measuring code generation quality. Industry standard from BigCode Project used by HuggingFace leaderboards.

davila7

Obtenir

evaluating-llms-harness

18Kdev-testing

Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting academic results, or tracking training progress. Industry standard used by EleutherAI, HuggingFace, and major labs. Supports HuggingFace, vLLM, APIs.

davila7

Obtenir

langsmith-observability

18Kdev-testing

LLM observability platform for tracing, evaluation, and monitoring. Use when debugging LLM applications, evaluating model outputs against datasets, monitoring production systems, or building systematic testing pipelines for AI applications.

davila7

Obtenir

senior-qa

18Kdev-testing

Comprehensive QA and testing skill for quality assurance, test automation, and testing strategies for ReactJS, NextJS, NodeJS applications. Includes test suite generation, coverage analysis, E2E testing setup, and quality metrics. Use when designing test strategies, writing test cases, implementing test automation, performing manual testing, or analyzing test coverage.

davila7

Obtenir

prowler-test-ui

13Kdev-testing

E2E testing patterns for Prowler UI (Playwright). Trigger: When writing Playwright E2E tests under ui/tests in the Prowler UI (Prowler-specific base page/helpers, tags, flows).

prowler-cloud

Obtenir

prowler-test-api

13Kdev-testing

Testing patterns for Prowler API: JSON:API, Celery tasks, RLS isolation, RBAC. Trigger: When writing tests for api/ (JSON:API requests/assertions, cross-tenant isolation, RBAC, Celery tasks, viewsets/serializers).

prowler-cloud

Obtenir

playwright

13Kdev-testing

Playwright E2E testing patterns. Trigger: When writing Playwright E2E tests (Page Object Model, selectors, MCP exploration workflow). For Prowler-specific UI conventions under ui/tests, also use prowler-test-ui.

prowler-cloud

Obtenir

prowler-test-sdk

13Kdev-testing

Testing patterns for Prowler SDK (Python). Trigger: When writing tests for the Prowler SDK (checks/services/providers), including provider-specific mocking rules (moto for AWS only).

prowler-cloud

Obtenir

pytest

13Kdev-testing

Pytest testing patterns for Python. Trigger: When writing or refactoring pytest tests (fixtures, mocking, parametrize, markers). For Prowler-specific API/SDK testing conventions, also use prowler-test-api or prowler-test-sdk.

prowler-cloud

Obtenir

agent-browser

13Kdev-testing

Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.

vercel-labs

Obtenir

gpui-test

9.9Kdev-testing

Writing tests for GPUI applications. Use when testing components, async operations, or UI behavior.

longbridge

Obtenir

context-degradation

7.6Kdev-testing

This skill should be used when the user asks to "diagnose context problems", "fix lost-in-middle issues", "debug agent failures", "understand context poisoning", or mentions context degradation, attention patterns, context clash, context confusion, or agent performance degradation. Provides patterns for recognizing and mitigating context failures.

muratcankoylan

Obtenir

gh-fix-ci

7.2Kdev-testing

Use when a user asks to debug or fix failing GitHub PR checks that run in GitHub Actions; use `gh` to inspect checks and logs, summarize failure context, draft a fix plan, and implement only after explicit approval. Treat external providers (for example Buildkite) as out of scope and report only the details URL.

openai

Obtenir

agent-review

7Kdev-testing

Get external agent review and feedback. Routes Anthropic models through Claude Agent SDK (uses local subscription) and other models through OpenRouter API. Use for code review, architecture feedback, or any external consultation.

civitai

Obtenir

my-first-skill

6.9Kdev-testing

Example skill demonstrating Anthropic SKILL.md format. Load when learning to create skills or testing the OpenSkills loader.

numman-ali

Obtenir

adaptyv

6.8Kdev-testing

Cloud laboratory platform for automated protein testing and validation. Use when designing proteins and needing experimental validation including binding assays, expression testing, thermostability measurements, enzyme activity assays, or protein sequence optimization. Also use for submitting experiments via API, tracking experiment status, downloading results, optimizing protein sequences for better expression using computational tools (NetSolP, SoluProt, SolubleMPNN, ESM), or managing protein design workflows with wet-lab validation.

K-Dense-AI

Obtenir

seo-audit

5.1Kdev-testing

When the user wants to audit, review, or diagnose SEO issues on their site. Also use when the user mentions "SEO audit," "technical SEO," "why am I not ranking," "SEO issues," "on-page SEO," "meta tags review," or "SEO health check." For building pages at scale to target keywords, see programmatic-seo. For adding structured data, see schema-markup.

coreyhaines31

Obtenir

honesty

3.9Kdev-testing

Behavioral guideline for providing brutally honest feedback. Use always - this skill defines core interaction expectations for code review and technical discussions.

EpicenterHQ

Obtenir

rust-errors

3.9Kdev-testing

Rust to TypeScript error handling patterns for Tauri apps. Use when defining Rust errors that will be passed to TypeScript, handling Tauri command errors, or creating discriminated union error types.

EpicenterHQ

Obtenir

dead-code

3.4Kdev-testing

Find unused functions and dead code in the codebase

parcadei

Obtenir

debug-hooks

3.4Kdev-testing

Systematic hook debugging workflow. Use when hooks aren't firing, producing wrong output, or behaving unexpectedly.

parcadei

Obtenir

dev-browser

3.1Kdev-testing

Browser automation with persistent page state. Use when users ask to navigate websites, fill forms, take screenshots, extract web data, test web apps, or automate browser workflows. Trigger phrases include "go to [url]", "click on", "fill out the form", "take a screenshot", "scrape", "automate", "test the website", "log into", or any browser interaction request.

SawyerHood

Obtenir

playwriter

2.7Kdev-testing

Control the user own Chrome browser via Playwriter extension with Playwright code snippets in a stateful local js sandbox via playwriter cli. Automate web interactions, take screenshots, inspect accessibility trees, debug & profile web applications. Run `playwriter skill` command to read the complete up to date skill

remorses

Obtenir

Test

Skills List

find-bugs

evaluating-code-models

evaluating-llms-harness

langsmith-observability

senior-qa

prowler-test-ui

prowler-test-api

playwright

prowler-test-sdk

pytest

agent-browser

gpui-test

context-degradation

gh-fix-ci

agent-review

my-first-skill

adaptyv

seo-audit

honesty

rust-errors

dead-code

debug-hooks

dev-browser

playwriter