
Training
Summer Training on Apache Hadoop from Naresh i Technologies, Hyderabad
Data Analyst and Programmer
HBTI Kanpur
I am a final year student of Harcourt Butler Technical University(TIER- II institute) founded in 1920, kanpur, Uttar Pradesh.I am pursuing Master of Computer Application
I am a simple, friendly and an ambitious person. I love to swim, eat dishes that I cook myself and hang out with friends. Technology and aesthetics interest me a lot. I also love to travel and discover new places, and meet new people.
Recently, I have earned Big Data Spark Foundation badge issued by IBM. I love playing with data. I have worked on projects that involved data analysis on huge datasets to find solutions to challenging research problems.
Here is the link to my Resume.
Given are some of the major projects I have worked on.
Summer Training on Apache Hadoop from Naresh i Technologies, Hyderabad
An xml file which is all about the reviews of different user, size around 2Gb: using a mapreduce(use some bad words to find negative reviews), changes unstructred data to structred data and perform operations to filter the data using Pig, Hive,and loads back to RDBMS(MySQL) using Sqoop
Get the revenue and number of orders from order items table on daily basis using Sqoop, hadoop and Apache spark.
import all the tables from Mysql to hdfs using Sqoop
join both the datasets(tables).
operation perform on the RDDs and the dataframes to find the desired results
Language:Scala,SQL
Tools used: Hadoop ecosystem and Spark framework
OS:CentOS
HadoopVersion:2.2.1