Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🤖[info] text2geoql and text2geoql-dataset #471

Open
yuiseki opened this issue May 5, 2024 · 0 comments
Open

🤖[info] text2geoql and text2geoql-dataset #471

yuiseki opened this issue May 5, 2024 · 0 comments

Comments

@yuiseki
Copy link
Member

yuiseki commented May 5, 2024

text2geoql

  • I have defined a natural language processing task named text2geoql and am in the process of building a dataset for it
  • text2geoql is a task that translates arbitrary natural language into reasonable geoql based on the intent
  • geoql is an abbreviation for "Geospatial data query languages"
    • Off course, geoql contains overpassql

text2geoql-dataset

  • https://github.com/yuiseki/text2geoql-dataset
    • This repository publishes over 1000 Overpass QLs that are paired with the TRIDENT intermediate language
    • These Overpass QLs, except for the original 100 Overpass QLs, were automatically generated by TinyDolphin, an very tiny LLM fine-tuned from TinyLlama
    • These Overpass QLs have been verified to send actual requests to the Overpass API and obtain correct results
  • This dataset is may be the first ever synthetic dataset generated by LLM in the field of GIS

Related:

@yuiseki yuiseki self-assigned this May 5, 2024
@hfu hfu changed the title [info] text2geoql and text2geoql-dataset 🤖[info] text2geoql and text2geoql-dataset May 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants