Skip to content
Snippets Groups Projects
Commit 388a85d4 authored by Meiqi Guo's avatar Meiqi Guo
Browse files

Update README.md

parent 211f63e1
No related branches found
No related tags found
No related merge requests found
# Big Data Process Assignment 2
I first try this
\ No newline at end of file
## Pre-processing the input
For the part of pre-procesing, the input consists of:
* the document corpus of [pg100.txt](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/input/pg100.txt)
* the [Stopword file](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/input/Stopwords) which I made in the assignment 1
* the [Words with frequency file](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/input/wordfreq) of pg100.txt that I obtained by runnning the assignment
1 with a slight changement of [MyWordCount](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment1/blob/master/MyWordCount.java).
All the details are written in my code Process.java.
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment