Large scale neural data is becoming increasingly available, but it can be difficult to find the right dataset for a given question. This project is a semantic search interface for exploring neuroscience datasets from OpenNeuro, DANDI, and BIDS using natural language queries.
- Enter search terms in the search bar to find relevant datasets
- Use the sidebar filters to narrow down results by:
- Imaging modality
- Species
- Task type
- Dataset size
- Click dataset titles to view them on their respective websites
- Expand README sections to learn more about each dataset
- marqo-python
- streamlit
- Docker (for running Marqo)
You need to have Marqo running so that the app can connect to it.
- https://github.com/marqo-ai/marqo-on-kubernetes
- or use ngrok to expose your local server to the public internet for testing, I've included a docker-compose.yml file to start both Marqo and ngrok.
docker compose up
The, you can deploy the frontend to Railway. Set the MARQO_URL environment variable to the public URL of your Marqo instance.
This project uses PDM to manage dependencies. To install the dependencies, run:
pdm install
To run the dataset downloaders, run:
python -m src.ml_on_the_mind.download.openneuro_downloader
python -m src.ml_on_the_mind.download.dandi_downloader
To run the database, run:
docker rm -f marqo
docker pull marqoai/marqo:latest
docker run --name marqo -it -p 8882:8882 marqoai/marqo:latest
To load the data into the database, run:
python -m src.ml_on_the_mind.build_vector_db
To run the app, run:
pdm run streamlit run src/ml_on_the_mind/app.py