LLM Tuning Patterns

Evidence-based patterns for configuring LLM parameters, based on APOLLO and Godel-Prover research.

Pattern

Different tasks require different LLM configurations. Use these evidence-based settings.

Theorem Proving / Formal Reasoning

Based on APOLLO parity analysis:

Parameter	Value	Rationale
max_tokens	4096	Proofs need space for chain-of-thought
temperature	0.6	Higher creativity for tactic exploration
top_p	0.95	Allow diverse proof paths

Proof Plan Prompt

Always request a proof plan before tactics:

Given the theorem to prove:
[theorem statement]

First, write a high-level proof plan explaining your approach.
Then, suggest Lean 4 tactics to implement each step.

The proof plan (chain-of-thought) significantly improves tactic quality.

Parallel Sampling

For hard proofs, use parallel sampling:

Generate N=8-32 candidate proof attempts
Use best-of-N selection
Each sample at temperature 0.6-0.8

Code Generation

Parameter	Value	Rationale
max_tokens	2048	Sufficient for most functions
temperature	0.2-0.4	Prefer deterministic output

Creative / Exploration Tasks

Parameter	Value	Rationale
max_tokens	4096	Space for exploration
temperature	0.8-1.0	Maximum creativity

Anti-Patterns

Too low tokens for proofs: 512 tokens truncates chain-of-thought
Too low temperature for proofs: 0.2 misses creative tactic paths
No proof plan: Jumping to tactics without planning reduces success rate

Source Sessions

This session: APOLLO parity - increased max_tokens 512->4096, temp 0.2->0.6
This session: Added proof plan prompt for chain-of-thought before tactics

Related Skills

verify

243K

Use when you want to validate changes before committing, or when you need to check all React contribution requirements.

facebook

Holen

test

243K

Use when you need to run tests for React core. Supports source, www, stable, and experimental channels.

facebook

Holen

feature-flags

243K

Use when feature flag tests fail, flags need updating, understanding @gate pragmas, debugging channel-specific test failures, or adding new flags to React.

facebook

Holen

extract-errors

243K

Use when adding new error messages to React, or seeing "unknown error code" warnings.

facebook

Holen

flow

243K

Use when you need to run Flow type checking, or when seeing Flow type errors in React code.

facebook

Holen

flags

243K

Use when you need to check feature flag states, compare channels, or debug why a feature behaves differently across release channels.

facebook

Holen

llm-tuning-patterns

LLM Tuning Patterns

Pattern

Theorem Proving / Formal Reasoning

Proof Plan Prompt

Parallel Sampling

Code Generation

Creative / Exploration Tasks

Anti-Patterns

Source Sessions

You Might Also Like

Related Skills

verify

test

feature-flags

extract-errors

flow

flags