Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not able to find Data set on which code has to be run #3

Open
suyashaoc opened this issue Jul 19, 2016 · 2 comments
Open

Not able to find Data set on which code has to be run #3

suyashaoc opened this issue Jul 19, 2016 · 2 comments

Comments

@suyashaoc
Copy link

Please provide link where can i get the data for succesfully running the code

@adamjshook
Copy link
Owner

You can download the data set from the Internet Archive: https://archive.org/details/stackexchange

Note that the data has been updated since the book was written, so you may run into some data-related errors when running the code.

@badalrocks
Copy link

Hi Adam,

Firstly thanks for writing such a wonderful book about MapReduce Design Patterns. I am using it to practice hands-on Map Reduce program.

I have the same issue about getting the dataset. For example, I am on Hierarchical design pattern and looking for 2 flat files: posts.txt and comments.txt. I am not sure where to find these files in above link you posted: there are 333 zip files.

Will it be feasible for you to post input data set for running code under:
mapreducepatterns/MRDP/src/main/resources/

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants