De novo genome assembly using De Bruijn graph algorithms.
- De Bruijn Graphs: Graph-based representation of sequencing reads
- K-mers: Sequence fragmentation and overlap detection
- Contig Assembly: Building contiguous sequences from short reads
| File | Description |
|---|---|
de_bruijn_assembly.ipynb |
Main assembly notebook |
data/ |
Input sequencing reads and outputs |
The De Bruijn graph approach:
- Fragment reads into k-mers
- Build a graph where nodes are (k-1)-mers
- Edges represent k-mers connecting nodes
- Find Eulerian paths to reconstruct sequences