Skip to content

merryHunter/url-topic-mining

Folders and files

NameName
Last commit message
Last commit date

Latest commit

1b04097 · Mar 1, 2017

History

60 Commits
Feb 26, 2017
Feb 6, 2017
Feb 4, 2017
Mar 1, 2017
Feb 19, 2017
Feb 24, 2017
Feb 23, 2017
Feb 19, 2017
Feb 19, 2017
Feb 23, 2017
Mar 1, 2017
Feb 19, 2017

Repository files navigation

URL topic detection

Having a set of URL(format: [latitude, longitude, url1|url2...]) that mobile users have visited, you know for each URL the location of the user that asked that URL. We consider the texts of the pages that the URLs are pointing and analyze them to identify topics (a topic is a vector of words that appear together very often). Imagine that we would like to know the topics in different regions of the map.

Input:

  • A set U of pairs [Location, URLs]
  • An area A of the map determined by a top left and a bottom right point)
  • A step S (e.g. 2km)

Output:

  • Create a grid by dividing the area A in squares of size SxS and identify and show the popular topics in each such square.

Look at the report.pdf

About

Final project for Data Mining & Big Data course

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published