Skip to content
This repository was archived by the owner on Jan 12, 2019. It is now read-only.
/ kcna_scraper Public archive

A KCNA.jp scraper for a friends master thesis on Korean propaganda

Notifications You must be signed in to change notification settings

codebykyle/kcna_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

What is this

This project is dedicated to scraping the North Korean http://kcna.co.jp website for analysis for a friends masters thesis. The KCNA website requires a Japanese IP in order to view its content.

#Installing

Linux and Mac

npm install

Windows

npm install --no-bin-links

Running

Go to the directory you extracted the project to:

cd ~/kcna_scraper

From that directory, run the index file of src

node /src/index.js [dates|content|body|all|help]

Dates

Dates will go through the calendar listing and pull all available dates were content is reported to have been published

Content

Content will go through all dates discovered and find available articles

Body

Body will go through all available articles and parse the contents

All

All will run everything in order

Examining data

Data is stored in the ./cache directory.

About

A KCNA.jp scraper for a friends master thesis on Korean propaganda

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published