Process Flow
Detailed description of each process in the CheckRef pipeline.
Process Execution Order
- VALIDATE_VCF_FILES - Parallel (one per VCF)
- CHECK_ALLELE_SWITCH - Parallel (one per chromosome)
- REMOVE_SWITCHED_SITES or CORRECT_SWITCHED_SITES - Parallel
- VERIFY_CORRECTIONS - Parallel
- CREATE_SUMMARY - Single process (aggregates all)
Parallelization
CheckRef automatically parallelizes across chromosomes:
- 22 autosomes + X, Y, MT can run simultaneously
- Each chromosome is independent
- No cross-chromosome dependencies
- Linear scaling with available resources
Error Handling
- Failed VCF validation → Skip chromosome
- Build mismatch detected → Graceful exit with message
- Process failure → Can resume with
-resume - Verification failure → Reported in logs
See Also
- Modules - Process details
- Resource Usage - CPU/memory needs
