Skip to content
Snippets Groups Projects
Commit 34af58cc authored by Meiqi Guo's avatar Meiqi Guo
Browse files

Update Report.md

parent b2d824e1
Branches master
No related tags found
No related merge requests found
...@@ -80,7 +80,7 @@ Collections.sort(wordList, new Comparator<String>() { ...@@ -80,7 +80,7 @@ Collections.sort(wordList, new Comparator<String>() {
You can see the output file [here](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/output/output_preprocess). You can see the output file [here](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/output/output_preprocess).
All the details are written in my code [Preprocess.java](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/output/Preprocess.java). All the details are written in my code [Preprocess.java](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/Preprocess.java).
## Set-similarity joins ## Set-similarity joins
> You are asked to efficiently identify all pairs of documents (d1, d2) that are similar (sim(d1, d2) >= t), given a similarity function sim and a similarity threshold t. Specifically, assume that: > You are asked to efficiently identify all pairs of documents (d1, d2) that are similar (sim(d1, d2) >= t), given a similarity function sim and a similarity threshold t. Specifically, assume that:
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment