ChromoSort Documentation

ChromoSort is a reference-guided assembly curation toolkit for sorting contigs, conservatively cleaning mostly-correct assemblies, splitting reviewed chimeric contigs, making manual cuts, reviewing dot plots, scaffolding ordered contigs, applying reviewed graph-supported gap fills, and plotting existing whole-genome alignments.

This documentation is organized as a user guide for installing ChromoSort, preparing inputs, running common workflows, and reading command outputs.

The most important workflow rule is that alignment evidence matches one exact FASTA pair. MUMmer coords or minimap2 PAF generated from raw.fa can support multiple decisions about raw.fa, but edited outputs such as ordered.fa, fixed.fa, manual FASTA exports, and scaffold FASTAs need fresh alignments before they are used as inputs to another alignment-dependent command.

Start Here

Commands

Running Analyses

Project Information

Main Workflow

  1. Generate MUMmer show-coords or minimap2 PAF alignments between the reference and assembly.
  2. For mostly-correct assemblies, use chromo clean for conservative sort-filter-fix cleanup, then re-align the cleaned FASTA.
  3. For more complex cases, use chromo plot and the dot-plot guide, then use chromo manual when helpful to inspect suspicious contigs.
  4. Use chromo eval and task-specific chromo manual dashboards when outlier fix, scaffold, or gapfill decisions need table or GUI review.
  5. Use chromo fix or chromo cut for reviewed sequence edits.
  6. Re-align the fixed assembly.
  7. Use chromo sort to assign, filter, orient, and order contigs.
  8. Re-align again if a later command should operate on ordered.fa rather than on the original assembly and its assignment report.
  9. Use chromo scaffold to build one scaffold per reference sequence.
  10. Use chromo gapfill only when reviewed GFA paths should replace N gaps.