Skip to content
Snippets Groups Projects

Repository graph

You can move around the graph by using the arrow keys.
Select Git revision
  • master default
1 result
Created with Raphaël 2.2.019Mar181519Feb1813gitignoremastermastergitignoreCleaning all the useless files + gitignoreEfficient Methodclearing files for Qacleaning tempory filesCleaning directoriesAll pairs preprocessing AND change formatting of preprocessing (we add key)All changes we take into account capital lettersTaking into acount capital letters after Julien's advicePreprocessingon the whole pg100.txtClearing of Unique_words, not neededPreprocessing test on pg100_test (5 lines with 1 empty)resuming preprocessingDelete WordCount$Reduce.classDelete WordCount$Map.classchange structuration remove previous outputstart ass2Q4 invertedIndex with frequenciesQ3 count unique words...Inverted Index without frequencies- Question2Q1.iv 50 reduceurs + 1 combiners and compression map outputQ1.iii with BZip2 compression and 10 reducers, Snappy or Gzip are not working...Merge branch 'master' of https://gitlab.my.ecp.fr/2014meftahm/bpadata redundanceDelete pg100.txt, pg3200.txt, pg31100.txtQa.i et Qa.ii, there is a problem with the combiner we dont retrieve the number of stopwords... this is weirdreorganisation folder treefull projectMerge branch 'master' of gitlab.my.ecp.fr:2014meftahm/bpadata filesAdd new file
Loading