diff --git a/Report.md b/Report.md index b11226d192f58fafd898b1434a5c6c2f27c4c96e..3db4d14ad60e83939550d3cb05e7af84586b26a0 100644 --- a/Report.md +++ b/Report.md @@ -7,7 +7,7 @@ For the part of pre-procesing, the input consists of: * the [Words with frequency file](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/input/wordfreq) of pg100.txt that I obtained by runnning the assignment 1 with a slight changement of [MyWordCount.java](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment1/blob/master/MyWordCount.java). -I do the following tasks in [Preprocess.java](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/output/Preprocess.java): +I do the following tasks in [Preprocess.java](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/Preprocess.java): **STEP 1: Remove all stopwords**