-
Notifications
You must be signed in to change notification settings - Fork 8
Description
Hello!
Thank you for the excellent work on pLM-BLAST!
I'm currently testing pLM-BLAST using the same protein FASTA file as both the query and the database. This file contains approximately 20,355 sequences.
Commands used:
python embeddings.py start a.fasta pLM-blastDB -embedder pt --gpu -bs 0 --asdir
python embeddings.py start a.fasta a.pt –gpu
python scripts/plmblast.py pLM-blastDB a pLM-blast_hits.csv
After running these commands, I got the output file pLM-blast_hits.csv. However, it contains only five columns, where the queryid values are numbers, and there are no columns for target ID, score, identity, or similarity.
It seems that i lost quite a lot of imformation.
See the following screenshot of my output:

Could you help me figure it out? Thanks a lot.