A collection of Python utilities for genomic data analysis, focusing on Hi-C, single-cell, and sequence data processing.
- translocation_hic.py: Detect potential chromosomal translocations using cooler format Hi-C data
- domain_caller.py: Call megabase-sized domains from bedGraph files
- juicer_2_tadbit.py: Convert Juicer dense matrices to TADbit format
- matrix.py: Convert TADbit matrices to BUTLRTools format
- singlecell_DamID.py: Process scDamID data (Kind et al., 2015) into bedGraph format
- basemods_separator.py: Parse PacBio base modifications (m6A, m4C) from GFF files
- bismark_chr_cov.py: Add chromosome prefix to Bismark coverage files
genomics-tools/
├── projects/
│ ├── hic_analysis/
│ ├── scdamid_analysis/
│ └── sequence_processing/
├── src/
│ ├── formats/
│ ├── analysis/
│ └── utils/
├── tests/
└── docs/
git clone https://github.com/yourusername/genomics-tools.git
cd genomics-tools
- numpy
- pandas
- cooler
- TADbit
- BUTLRTools
python src/analysis/translocation_hic.py input.cool
python src/formats/juicer_2_tadbit.py -i input.txt -o output.txt
python src/analysis/singlecell_DamID.py input_data.txt output.bedgraph
Detailed documentation for each project can be found in their respective directories:
Contributions are welcome! Please read our contributing guidelines before submitting pull requests.
This project is licensed under the MIT License - see the LICENSE file for details.
If you use these tools in your research, please cite.
- GitHub Issues: For bug reports and feature requests