@@ -82,3 +82,7 @@ You can see the output file [here](https://gitlab.my.ecp.fr/2014guom/BigDataProc
...
@@ -82,3 +82,7 @@ You can see the output file [here](https://gitlab.my.ecp.fr/2014guom/BigDataProc
All the details are written in my code [Preprocess.java](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/output/Preprocess.java).
All the details are written in my code [Preprocess.java](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/output/Preprocess.java).
## Set-similarity joins
For this part, I can't use directly 'pg100.txt' with 12