Validation System Quick Reference
Quick Commands
# Development workflow
python scripts/dev_validate.py --quick # Quick validation (5-10s)
python scripts/dev_validate.py --status # Show current status
python scripts/dev_validate.py --compare # Compare with previous
# Before commit
python scripts/dev_validate.py --full # Full validation (15-30s)
# Component testing
python scripts/dev_validate.py --component prompts # Test prompts
python scripts/dev_validate.py --component quality # Test quality system
# Advanced comparison
python scripts/validation_comparison.py --mode=trend # Show trends over time
Understanding Output Files
File |
Purpose |
When Populated |
latest_validation_summary.json |
Current test results |
Always |
reasoning_metrics.json |
Performance metrics |
Always |
improvement_patterns.json |
Pattern analysis |
After ≥10 runs |
learning_insights.json |
Learning insights |
After ≥25 runs |
Key Metrics to Watch
Metric |
Target |
Alert Level |
Current |
Success Rate |
100% |
<100% |
100% PASS |
Quality Score |
≥85% |
<80% |
88.5% PASS |
Execution Time |
≤30s |
>45s |
25.0s PASS |
Biological Accuracy |
≥90% |
<85% |
92% PASS |
Empty Files (Normal Behavior)
improvement_patterns.json
: Empty until 10+ validation runs
learning_insights.json
: Empty until 25+ validation runs
This is expected behavior - the system needs enough data to identify patterns and generate insights.
Troubleshooting
Issue |
Solution |
"Components not available" |
pip install -r requirements.txt |
No result files |
Run validation first: python scripts/dev_validate.py --quick |
Test failures |
Check specific error in latest_validation_summary.json |
Performance regression |
Compare trends: python scripts/validation_comparison.py --mode=trend |
Development Workflow
- Make changes (code, prompts, config)
- Quick validation:
python scripts/dev_validate.py --quick
- Check status: Look for PASS or FAIL in output
- Compare changes:
python scripts/dev_validate.py --compare
- Before commit:
python scripts/dev_validate.py --full
Available Components
prompts
- Prompt management system
context
- Context enhancement
quality
- Quality validation
intelligence
- Artifact intelligence
integration
- Cross-component integration