diff --git a/README.md b/README.md
index 5d771856b507121c3917fcb0d724851f3e5b35e1..0ed83fe12651e9981d530432e3c9d9ab5fda22ce 100644
--- a/README.md
+++ b/README.md
@@ -1,2 +1,13 @@
 # Big Data Process Assignment 2 
-I first try this
\ No newline at end of file
+## Pre-processing the input
+For the part of pre-procesing, the input consists of:
+* the document corpus of [pg100.txt](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/input/pg100.txt)
+* the [Stopword file](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/input/Stopwords) which I made in the assignment 1
+* the [Words with frequency file](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment2/blob/master/input/wordfreq) of pg100.txt that I obtained by runnning the assignment
+1 with a slight changement of [MyWordCount](https://gitlab.my.ecp.fr/2014guom/BigDataProcessAssignment1/blob/master/MyWordCount.java).
+
+
+
+
+All the details are written in my code Process.java.
+