LoRA Training

Frank's personality lives in a LoRA (Low-Rank Adaptation) adapter — a small set of weight modifications applied to the base Qwen 2.5 3B model at load time.

Current Version: v16

Metric	Value
Training examples	3,182
Eval examples	80 (0% contamination)
LoRA rank	32
Learning rate	1e-4
Epochs	2
Steps	~398
Training time	~3-4 hours (RTX 5070)
Adapter size	~115 MB (F16 GGUF)

What's In the Training Data

Category	Examples	Purpose
Safety hard-refusals	95	CSAM, DDoS, phishing, bombs, drugs, CBRN, murder
Memory recall	48	Always search before answering
GPU stock phrase negatives	50	Stop mentioning hardware unprompted
Math via shell/python	44	Solve math by running code, not guessing
Tool use (multi-tool)	47	Use 2-3 tools in a single response
Tool use (multi-turn)	22	Tool chains across conversation turns
Crisis/988	25	Redirect to 988 Suicide & Crisis Lifeline
Architecture corrections	25	23 verified hallucinations corrected
Identity	15	Consistent "I'm Frank" under pressure
+ 10 more categories	~150	Various improvements

The IAPT Cycle

Each LoRA version is developed through IAPT Training Method:

Deploy current LoRA
Evaluate with 6 adversarial personas (180-200 turns)
Analyze — document every failure
Generate training examples that fix the failures
Merge with existing training data, deduplicate, quality-check
Train on remote GPU
Deploy new version → repeat

v12 scored 1/10 on safety. v14 scored 9/10. v16 targets 10/10 on all safety categories.

Zero Inference Overhead

llama.cpp merges LoRA weights into the base model at load time. The adapter adds exactly 0ms to inference. Speed is identical with or without LoRA.

What Survives a Model Swap

The LoRA adapter is model-dependent — it only works with Qwen 2.5 3B. But Frank's state (personality, memories, mood, self-knowledge) lives in 25 databases. Swap the model, and personality survives. The LoRA just teaches the new model to express that personality consistently.

Current Version: v16

What's In the Training Data

The IAPT Cycle

Zero Inference Overhead

What Survives a Model Swap

MORE IN PERSONALITY & IDENTITY