data-validation

Security pattern for input validation and sanitization. Use when implementing input handling, preventing injection attacks (SQL, XSS, command), ensuring data integrity, or processing data from untrusted sources. Addresses "Entity provides unexpected data" problem.

2Star

1Fork

更新于 1/22/2026

获取 Skill 源代码

SKILL.md

readonly只读

name

data-validation

description

Data Validation Security Pattern

Ensures all incoming data is validated against specifications before processing, preventing injection attacks, data corruption, and unexpected behavior.

Problem Addressed

Entity provides unexpected data: Malicious or malformed input causes:

Injection attacks (SQL, XSS, command injection)
System crashes or unexpected behavior
Data corruption
Security bypasses

Core Components

Role	Type	Responsibility
Entity	Entity	Sends data to system
Enforcer	Enforcement Point	Intercepts all incoming data
Validator	Decision Point	Validates data against specification
Specification Provider	Information Point	Manages validation rules
System	Entity	Processes validated data

Data Elements

data: Input from entity (raw)
canonical_data: Normalized, validated form
specification: Rules defining valid data
type: Identifier for applicable specification
error: Validation failure message

Validation Flow

Entity → [data] → Enforcer
Enforcer → [data] → Validator
Validator → [get_specification(type)] → Specification Provider
Specification Provider → [specification] → Validator
Validator → [validate, transform to canonical] → Validator
Validator → [canonical_data or error] → Enforcer
Enforcer → [canonical_data] → System (if valid)
        → [error] → Entity (if invalid)

Enforcer intercepts ALL incoming data
Validator retrieves appropriate specification
Validator transforms to canonical form
Validator checks against specification
Valid: canonical data forwarded to System
Invalid: error returned to Entity

Validation Principles

Validate Everything

All data from uncontrolled sources
Parameters, headers, cookies, files
Data from APIs, databases (defense in depth)

Canonical Form

Transform data to standardized form:

Remove/escape special characters
Decode encoded values
Normalize Unicode
Parse structured data to typed objects

Benefit: System only processes data in known format.

Allowlist vs. Blocklist

Allowlist (preferred): Define what IS allowed
Blocklist (risky): Define what is NOT allowed

Blocklists fail against unknown attack patterns. Use allowlists.

Validate Early, Validate Often

Validate at system boundary (earliest point)
Re-validate near code that relies on data
Defense in depth

Validation Types

Type Validation

Ensure data matches expected type
Integer, string, boolean, date, email, URL

Range/Length Validation

Numeric bounds
String length limits
Array size limits

Format Validation

Regular expressions (carefully!)
Structural patterns
Protocol conformance

Business Logic Validation

Application-specific rules
Cross-field validation
State-dependent validation

Security Considerations

Validation ≠ Authorization

Validation: Is this data well-formed?
Authorization: Is entity allowed to use this data?

Both are required. Valid data doesn't mean authorized access.

Error Messages

Don't reveal validation internals to attackers
Log detailed errors server-side
Return generic errors to clients

Encoding Output

Validation alone doesn't prevent all injection:

Still encode output for context (HTML, SQL, etc.)
Use parameterized queries
Use context-appropriate escaping

File Uploads

Special validation needed:

Verify content type (not just extension)
Scan for malware
Restrict file sizes
Store outside web root

Structured Data (JSON, XML)

Parse with secure parser
Disable external entity processing (XXE)
Validate against schema
Limit nesting depth

Regular Expression Safety

Avoid ReDoS-vulnerable patterns
Limit input length before regex
Test regex performance with malicious input

Common Validation Scenarios

Input Type	Validations
Username	Length, allowed characters, no control chars
Email	Format, length, allowlist domains (if applicable)
Integer	Type, range, positive/negative
URL	Protocol allowlist, format, no javascript:
File	Extension, content-type, size, malware scan
JSON	Schema validation, depth limits, size limits

Implementation Checklist

[ ] All entry points have validation
[ ] Canonical form transformation
[ ] Allowlist-based rules
[ ] Type checking
[ ] Length/range limits
[ ] Business rule validation
[ ] Secure error handling
[ ] Output encoding (separate from validation)
[ ] File upload validation
[ ] Structured data parsing safely
[ ] Re-validation near sensitive operations

Related Patterns

Authorisation (validation doesn't replace authorization)
Selective encrypted transmission (protect data in transit)
Log entity actions (log validation failures)

References

Source: https://securitypatterns.distrinet-research.be/patterns/04_01_001__data_validation/
OWASP Input Validation Cheat Sheet
OWASP XSS Prevention Cheat Sheet

Related Skills

create-pr

170Kdev-devops

Creates GitHub pull requests with properly formatted titles that pass the check-pr-title CI validation. Use when creating PRs, submitting changes for review, or when the user says /pr or asks to create a pull request.

n8n-io

获取

electron-chromium-upgrade

120Kdev-devops

Guide for performing Chromium version upgrades in the Electron project. Use when working on the roller/chromium/main branch to fix patch conflicts during `e sync --3`. Covers the patch application workflow, conflict resolution, analyzing upstream Chromium changes, and proper commit formatting for patch fixes.

electron

获取

pr-creator

92Kdev-devops

Use this skill when asked to create a pull request (PR). It ensures all PRs follow the repository's established templates and standards.

google-gemini

获取

clawdhub

87Kdev-devops

Use the ClawdHub CLI to search, install, update, and publish agent skills from clawdhub.com. Use when you need to fetch new skills on the fly, sync installed skills to latest or a specific version, or publish new/updated skill folders with the npm-installed clawdhub CLI.

moltbot

获取

tmux

87Kdev-devops

Remote-control tmux sessions for interactive CLIs by sending keystrokes and scraping pane output.

moltbot

获取

create-pull-request

57Kdev-devops

Create a GitHub pull request following project conventions. Use when the user asks to create a PR, submit changes for review, or open a pull request. Handles commit analysis, branch management, and PR creation using the gh CLI tool.

cline

获取

data-validation

Data Validation Security Pattern

Problem Addressed

Core Components

Data Elements

Validation Flow

Validation Principles

Validate Everything

Canonical Form

Allowlist vs. Blocklist

Validate Early, Validate Often

Validation Types

Type Validation

Range/Length Validation

Format Validation

Business Logic Validation

Security Considerations

Validation ≠ Authorization

Error Messages

Encoding Output

File Uploads

Structured Data (JSON, XML)

Regular Expression Safety

Common Validation Scenarios

Implementation Checklist

Related Patterns

References

You Might Also Like

Related Skills

create-pr

electron-chromium-upgrade

pr-creator

clawdhub

tmux

create-pull-request