cloudera certified hadoop professional

8
CLOUDERA CERTIFIED HADOOP PROFESSIONAL PRASHANT CHUTKE 1 Prashant Ramesh Chutke Cloudera Certified Bigdata Hadoop Professional [email protected], +353 0899415331 Dublin, Ireland BACKGROUND Ø Over 14 years of IT experience, includes major development and implementation of business applications in C/C++ & Bigdata technology. Ø Over 3 years of experience working on Bigdata related Technologies. Ø Cloudera Certified Apache Hadoop Professional CCDH Ø Certified in Bigdata Architecture program by BigDataTraining.IN Ø Certified in Accenture Management Development Academy through ISB. Ø Worked over 11 years with Accenture, currently working with Amazon Web Services. Ø Have cross industry development experience such as FS, Telecom & H&PS. Ø Proficient in understanding business processes/requirements and translating into technical requirement, providing estimation, Identifing issues & solution along with testing. Ø Strong skills in requirement gathering, mapping, GAP analysis, recommendations to Business process improvements, effort estimation, resource planning. Ø Hands-on expertise in Big Data technologies like Apache Hadoop- HDFS, Map Reduce, Spark, pig, Hive, Sqoop, Flume & other parts of Hadoop echo-system. Ø Exposure to design Hadoop solutions under both Cloudera as well as Hortonworks distributions. Ø Knowledge of distributed NoSQL database like HBase & Cassandra AND SQL solutions like Impala & spark SQL. Knowledge of Spark programming using Scala. Ø Exposure to AWS bigdata services – EMR, DynamoDB, Elastic Search, Kinesis, Data-pipeline and to basic AWS services – S3, EC2, ELB, EBS, Auto-scaling, Route53, RDS, CF and many more. Ø Good communication skills with proven track record of successfully working/managing diverse professionals, clients and team efficiently from client locations & hence direct exposure to work with client at on-shore. Ø Exposure to project management & working in an agile/SCRUM environment. SKILL SUMMARY Ø Languages known à C/C++, Basic Java, Scala, VBA, SQL etc. Ø Platforms à Unix/Linux/Windows, Apache Hadoop (HDFS) & AWS Ø Frameworks à EMR, MapReduce & Spark (using Scala). Ø Tools à Eclipse, Vi editor, Visual Studio, Putty, Hue, WinSCP, Auto-sys Ø Hadoop Ecosystem à Pig, Hive, Oozie, Hue Ø Data Ingestion à Sqoop, Flume, Kafka, Flafka, Kinesis Ø SQL Solutions à Hive, Spark-SQL, Impala Ø No SQL Solutions à HBase, Cassandra DynamoDB Ø Spark Unified Stack à Spark-Streaming, Spark-SQL, MLlib, GraphX Ø File Formats à Avro, Jason, Parquet, Sequence file, RC

Upload: prashant-chutke

Post on 12-Apr-2017

126 views

Category:

Documents


0 download

TRANSCRIPT

CLOUDERA CERTIFIED HADOOP PROFESSIONAL PRASHANT CHUTKE

1

PrashantRameshChutkeClouderaCertifiedBigdataHadoopProfessional

[email protected],+3530899415331

Dublin,Ireland

BACKGROUND

Ø Over 14 years of IT experience, includes major development and implementation of businessapplicationsinC/C++&Bigdatatechnology.

Ø Over3yearsofexperienceworkingonBigdatarelatedTechnologies.Ø ClouderaCertifiedApacheHadoopProfessionalCCDHØ CertifiedinBigdataArchitectureprogrambyBigDataTraining.INØ CertifiedinAccentureManagementDevelopmentAcademythroughISB.Ø Workedover11yearswithAccenture,currentlyworkingwithAmazonWebServices.Ø HavecrossindustrydevelopmentexperiencesuchasFS,Telecom&H&PS.Ø Proficient in understanding business processes/requirements and translating into technical

requirement,providingestimation,Identifingissues&solutionalongwithtesting.Ø Strong skills in requirement gathering, mapping, GAP analysis, recommendations to Business

processimprovements,effortestimation,resourceplanning.Ø Hands-onexpertiseinBigDatatechnologieslikeApacheHadoop-HDFS,MapReduce,Spark,pig,

Hive,Sqoop,Flume&otherpartsofHadoopecho-system.Ø ExposuretodesignHadoopsolutionsunderbothClouderaaswellasHortonworksdistributions.Ø KnowledgeofdistributedNoSQLdatabaselikeHBase&CassandraANDSQLsolutionslikeImpala&

sparkSQL.KnowledgeofSparkprogrammingusingScala.Ø ExposuretoAWSbigdataservices–EMR,DynamoDB,ElasticSearch,Kinesis,Data-pipelineandto

basicAWSservices–S3,EC2,ELB,EBS,Auto-scaling,Route53,RDS,CFandmanymore.Ø Good communication skills with proven track record of successfully working/managing diverse

professionals,clientsandteamefficientlyfromclientlocations&hencedirectexposuretoworkwithclientaton-shore.

Ø Exposuretoprojectmanagement&workinginanagile/SCRUMenvironment.

SKILLSUMMARY

Ø Languagesknown àC/C++,BasicJava,Scala,VBA,SQLetc.Ø Platforms àUnix/Linux/Windows,ApacheHadoop(HDFS)&AWSØ Frameworks àEMR,MapReduce&Spark(usingScala).Ø Tools àEclipse,Vieditor,VisualStudio,Putty,Hue,WinSCP,Auto-sysØ HadoopEcosystem àPig,Hive,Oozie,HueØ DataIngestion àSqoop,Flume,Kafka,Flafka,KinesisØ SQLSolutions àHive,Spark-SQL,ImpalaØ NoSQLSolutions àHBase,CassandraDynamoDBØ SparkUnifiedStack àSpark-Streaming,Spark-SQL,MLlib,GraphXØ FileFormats àAvro,Jason,Parquet,Sequencefile,RC

CLOUDERA CERTIFIED HADOOP PROFESSIONAL PRASHANT CHUTKE

2

PROJECTDETAILS

1)AMAZONWEBSERVICES(JULY2016–PRESENT)

CurrentlyworkingwithAmazonWebServices,Irelandatlevel5asbigdataprofessional.Majorexposureto–EMR,DynamoDB,ElasticSearch,Kinesis,Data-pipeline.

2)ACCENTURESERVICESPVT.LTD.(FEBRUARY2005–MAY2016)

I)PROJECT:AccentureDigitalSolutionFactory(July2015–May2016)

CLIENT:RSAInsuranceGroup

Platform:Cassandra,SparkS/w&Tools:DatastaxCassandra,Spark,CQL,Eclipse,WinSC

RSA Insurance Group plc (trading as RSA) is a British multinational general insurance companyheadquarteredinLondonUK.RSAhasmajoroperationsinIreland,ScandinaviaandCanadaandprovidesinsuranceproductsandservicesinmorethan140countriesthroughanetworkoflocalpartners.Ithas17millioncustomers.RSAownstheMoreThandirectcar,home,petandtravelinsurancebrandintheUnitedKingdom.More Than (company) also sells van, business car, shops and offices and business insurancethroughitsMoreThanBusinessoperation.Projectinvolves2stepdataingestiontoCassandra&fromthereonwardscarryoutD4PcalculationsusingSpark.Finallycalculateddatagotovisualizationlayer.

ROLE:HadoopDesigner/Developer

Ø CassandraDatamodellingusingNoSQLdatamodelingparadigms.Ø DesignXML&CSVfileingestionfromESBlayertoOperationalDataHub(ODH).Ø DesignXML&CSVfileingestionfromODHtoCassandrausingjavaDOM/XPath,de-serializationof

files&interactionwithCassandradriverAPIstoingestthesameinCassandra.Ø Real-timeData4pricingcalculationusingApacheSpark&Scala.Ø XMLtoCSVconversionusingDOM.Ø NoSQLdatamodellingforCassandra.Ø Reviewingtechnicaldesigndocument,testplan,testscripts,aswellundermypurview.Ø Codebasedesignincludescommonlibrarydesign,codeconfigurationetc.Ø Attending daily/weekly technical meetings, review meetings, client status meeting etc. and

highlightissues/concernontimelymanner,seeking/providingsolution.Ø Interviewingresourcefortheteam.

II)PROJECT:WellCarePBM(February2014–June2015)

CLIENT:WellCareHealthPlan,Inc.

Platform: Hadoop Lake, HDFS S/w& Tools: ApacheHive, Sqoop, Flume, Informatica BDE, Python, Hueinterface,Auto-sys,Putty,WinSCP,HortonworksDataPlatform(HDP)

WellCareHealthPlans,Inc.isbasedinTampa,Florida.WellCareprovidesMedicareandMedicaidmanagedcarehealthplansforover4millionmembers.WellcareHealthPlans,Inc.istheholdingcompanyforseveralsubsidiaries,includingWellCare,Staywell,HealthEase,Harmony,and'Ohana.Thepurposeoftheprojectwastosolutiontheloadingoffilesforclaims,encounters,eligibility/COBresponse,PDE,invoicedata,andHIXedgedata.Itprovidesonesingledatarepositorytostoreallinformationforpharmacyrelateddata.Thesolutionwastoloadeachrawfileintothestagingtables(DataLake)inpreparationforfuturerefinement

CLOUDERA CERTIFIED HADOOP PROFESSIONAL PRASHANT CHUTKE

3

offilesbeforebeingfinalizedandloadedintotheGreenPlum(datarepository). Inaddition,thesystemtrackseachfileeventandgeneratealertsbasedonwhetheritemspassorfailtheloadingprocess.

ROLE:HadoopArchitect/Designer/Developer

Ø WasresponsibleforcreatingPoVtosolutiontheproblemdomain.Thiswasdoneby identifyingdataprocessingareas,datamassagingstrategies&solutioningrecommendedtoolsforindividualareas.Eachareawasaswellprovidedwithalternate toolsbasedon relative features.PoVwaspresentedtotheclient&waswellappreciated.

Ø SeparatesolutionwascreatedtodealwithCDCdata.Ø Alsodesignedparameterstovalidatesourcedatafiles&dataconvergencetoHiveunderstandable

formatpreparingbestpracticedocumentforthesame.Ø Dataauditing,dataarchiving,datapurging,errorlogging&alertsystemswasthesupportsystems

designedaspartofdatagovernance.Ø Reviewingtechnicaldesigndocument,testplan,testscripts,fieldleveltesting-evidenceswasunder

mypurview.Ø Code base design includes common library design, code configuration designwas done during

developmentphase.Ø Attending daily/weekly technical meetings, review meetings, client status meeting etc. and

highlightissues/concernontimelymanner,seeking/providingsolution.Ø InterviewHadoopresourcefortheteam.

III)PROJECT:DigitalHadoopCapability(August2013–January2014)

CLIENT:AccentureTechnologyGroup

Platform:Hadoopcluster,HDFSS/w&Tools:ApacheHive,Pig,Sqoop,Hueinterface,Oozie,Putty,WinSCP,Clouderaplatform.

DigitalHadoopcapabilityistheplatformforalltheprojectsalignedtoindustrialaccountgroups.Hadoopcapability provides SME support, sales support in RFP/RFI, technical resources’ pool building, technicaltrainings,assetcreationetc.Iwaspartofcapabilityforaround6monthsduringwhichwehavedevelopedfewPoCsonHadoop&havetakensomeinitiativesasbelow:

ROLE:HadoopLead

Ø Externallytrained&certifiedonBigdataHadoopthroughBigDataTraining.INØ ClearedClouderacertificationforHadoopprofessional-CCD-410.Ø Activecontributionincapabilityforumforresolvingtechnicalqueries.Ø PartofrecruitmentdriveforhiringHadoopresources.Ø AccentureaccreditedtrainerforBigdataHadoopatMumbai,India.Ø Internalizefewindustrialtrainings&partofreviewteamforreviewingexistingtrainingmaterial.Ø TechnicalSMEsupporttoprojectsduringproposalphase.Ø ContributedtobelowProofofConcepts(PoCs)

o EnronE-mailAnalysis-Emaildatatransform&loadedintoHive.o HealthCareDataAnalysis–MovedlargetextualdatatoHive.o AnalyzerforWebLogData–Appslogswasloadedintotextformat.o SuperMarketProductAnalysis-Tofindoutitemswhichusuallysoldtogether

CLOUDERA CERTIFIED HADOOP PROFESSIONAL PRASHANT CHUTKE

4

IV)PROJECT:AccentureTechnologyGrowthPlatform(February2010–July2013)

CLIENT:AccentureTGP

CentralCapabilityTeamperformsfunctionsforallthecapabilitiesunderAccenturedeliveryunits.AspartofthisteamIwasinvolvedinfollowingactivities:

ROLE:TeamLead

Ø Demandmanagementactivityincludesresourcerole-on/off,movementetc.Ø Project,Engagementmaintenance.Ø Laptopmanagementforcapability.Ø GlobalcareerpathStaffing&SMELoaning.Ø H1Bvisadrivesupport.Ø WBScreationandmaintenance.Ø Salesoperationactivities.Ø Recruitmentactivities.Ø DrivinggroupIDmaintenanceprocess.Ø Driving creation and maintenance activity for process documentation and automation related

activities.

V)PROJECT:Telstra(April2008–January2010)

CLIENT:Telstra

Telstra-CommunicationsCompanyinAustraliaprovidescustomerswithintegratedtelecommunicationsacrossfixedline,mobiles,broadband,information,transactionandsearchandpayTV(FOXTEL).Followingaremyresponsibilities:

ROLE:TeamLead

Ø Executetheleadresponsibilities.Ø Monitor&executeBC/Proformacycles.Ø UnderstandmonthlyreleaserequirementsforBilling.Ø CreationofSLAdocumentationandtrainingtootherleadsandtheteam.Ø Developgoodknowledgeofapplications.Ø IdentifyperformanceimprovementsinProductiontasks.Ø Drivetheteamtocreatedocumentation.Ø Escalatetheissuesatrighttime.Ø Pro-activelyidentifyandresolveissue.Ø Bettersynergywithteam.

CLOUDERA CERTIFIED HADOOP PROFESSIONAL PRASHANT CHUTKE

5

VI)PROJECT:BarclaysBankPLCUK(April2005–March2008)

CLIENT:BarclaysBank

Platform:UNIXS/w&Tools:UNIXScript,C/C++programming,JavaScript1.2,VisualStudio6.0

i)Subproject:GlobalLiquidityMonitoring

DistributedapplicationconsistsofagroupofapplicationsnamelyGLAS,BFAandGLM.Theseapplicationsare part of Business Banking portfolio. GLM is the Barclays Global Liquidity Management Module togenerate&routethepaymentmessagesgeneratedbyAFTS/BFA/SOLD.Itisalsousedtomonitorliquiditypositionacross3groupsofEuropeancountries.

ii)Subproject:GCIS2DistributedApplication

BarclaysBankfordistributedapplicationcalledGCIS(GlobalCreditInformationSystemVersion2).GCIS2isaglobalexposuremonitoringandreportingsystem,recordingcreditinformationfromthemajorityoflargecorporateandbankingrelationshipsacrossthegroup.CreditPreparationandSanctioning,RiskMonitoring,Interfaces and Non-Functional Area (which includes the Security areas, and details on the AccessPermissions)formsthebasicfunctionalareasofGCIS2.

ROLE:TeamLead

Iwasinvolvedinproviding:

Ø Managerial/technicalsupporttoGLMapplication.Ø ManagingDistributedAppsteam.Ø Supporttointerfacesarea,indisciplineofincident,problem&changesettlement.Ø Create/maintainmiddletiercomponents&utilitiesdevelopedasC++undertakingsØ Serversidecoding&debugging.Ø Unit/Integrationtesting.Ø Assessment&estimationofnewinterfaces.Ø Identifying&escalatinganyconnectivityrelatedissuesinDistributedApplications.Ø Allocatingandtrackingtasks.Ø Ensuringoptimalresourceutilization.

CLOUDERA CERTIFIED HADOOP PROFESSIONAL PRASHANT CHUTKE

6

3)PEROTSYSTEMSTSI(INDIA)LTD(DECEMBER2002–JANUARY2005)

PROJECT:IBAUnicareSupport(January2003–January2005)

CLIENT:TorexMedicalSystemsLimited

Platform:LinuxS/w&Tools:Apache,JavaScript1.2,C/C++programming,remoteScripting

i)Subproject:DataTakeOff

Theprojectwas related to transferofdata fromdatabase toASCII text file& visa-versa.Using the filespecifications,datafromiSOFT’slegacyAS(AdministrationSystem)istransferredintoiSoftiPMasaseriesofASCIItextfiles.Thefileformatsareintendedtorepresentthedatarequiredtoloadintoandoperatethesecondaryadministrationsystem.Theyaredesignedtobegenericintheirnature,suchthatthedatacanbeprovidedintothisformatinarelativelyseamlessmannerfromalmostanyexistingsecondarycareAS.

ii)Subproject:AuditCentre

“AuditCentre”wasoneofthebiggestprojectsIhaveworkedon,thisprojectbasicallyhadthreephases&asthenamesuggestwasbasicallymadefortheauditing&monitoringpurpose. Iwasactivedeveloper,testerforallthreephases.Thefirstphasewasspannedoverthreemonths;second&thirdphaseswasinoperationatthetime.IwasresponsiblefordevelopingmoduleslikeUserAuditing,Auditdatapresentation,usersearchscreen,reporting,simulatingtestdataetc.

iii)Subproject:BedManagementCentre

BedManagementCenterisoneoftheHealthCareProductofTorex.It’sfullydevelopedinlanguagecalledTBL (Transportable Business Logic), which is basically server side scripting language & in C/C++programmingforcharacterbasedapplication. It isbasicallydevelopedbyusingadvancetechniques likescheduling,batchprogramming,remotescripting,swappinglogic,statisticre-calculator,dynamicfront-endmanager,databaseflushingmanagerandmanymore.

ROLE:Developer

Ø DevelopmentofJavaScriptandHTMLbasedwebscreens.Ø Serversideprogramming.Ø Unit&Integrationtesting.

Ø Databasecreation.Ø Involvedinclientinteractionlikeraisingissues&getthemsolved.

Ø Deliveringtheprojectartifactsincludestraceabilitymatrix/Testlog.Ø Codereview/testscriptreview.

CLOUDERA CERTIFIED HADOOP PROFESSIONAL PRASHANT CHUTKE

7

4)SUPERTECHINFOWAREPVT.LTD.(JANUARY2002-NOVEMBER2002)

CLIENT:SupertechInfowareLtd./ChitraScanandImagingCentre,Mumbai,India.

Platform:WindowNTS/w&Tools:FrontPage2000, IIS,MS-Access2002,ASP,C/C++,JavaScript,VisualBasic6.0,DataReports.

i)Subproject:IndiaVoip

www.indiavoip.comisaone-stopplaceforVoIP–VoiceoverInternetProtocol,relatedinformation.Itisadatabasedrivensite.ThissiteprovidesbroadcoverageontheVoIPIndustrybothinIndiaaswellasglobally.Itfeaturesnews,research,markettrends,companies,vendors,products,servicesandothers.ItactsasamediumofdevelopingacommunityofVoIPprofessionals,amateursandthegeneralpublic.

ii)Subproject:ChitraScan

Clinic management system is for managing the day-to-day operations of the clinic.ThesystemincludesPatientInformationMaster,PatientVisitMaster,ReferralDoctorMaster,ProfessionalDoctorMaster,ServiceMaster,InventoryControlMaster,Accounts,Reports,BillGeneration,generatingdiagnosisfile,BackUpandothers.Themoduleswereinterlinkedandprovideefficientautomation.

iii)Subproject:ShoppingComplex

Inter-fone is aMumbai, India based company deals with VOIP technology; they havemore than fourbranchesinMumbai&oneinUSA.TheyarealsowholesalesinNATrouters,VOIPgateways,interfones&manymorethingsinVOIPstream.Supertechhasdevelopeddynamicwebsite,whichincludesshoppingcomplexasoneofthemajorcomponent.WholesitewascreatedusingActiveServerPages&MSaccess.

ROLE:Designer,Developer,Tester

Ø DatabaseDesign&componentrepository.Ø DesignedandDevelopedDiscussionForumandopinionpoll.

Ø DevelopedRegistrationForms.Ø Involvedinthedesignanddevelopmentofadminmodule.Ø Wasinvolvedindevelopment&componentrepository.

Ø Alsoinvolvedindebuggingtheindividualsoftwaremodules.Ø Wasresponsibleforunit&blackboxtesting.

Ø Paymentgatewayinterfacing.

CLOUDERA CERTIFIED HADOOP PROFESSIONAL PRASHANT CHUTKE

8

EDUCATION

Ø DiplomainElectricalEngineeringfromboardoftechnicalexaminationMumbai,IndiawithspecializationinSwitchgear&Protection,Electricalm/cDesign,IndustrialElectronicsandPowerSystemEngineering.

Ø Bachelor’sDegreeinElectricalEngineeringfromV.J.T.IMatunga(UniversityofMumbai,India)withspecializationinComputerApplicationinPowersystem,ProjectManagement.

PROFESSIONALTRAININGANDCONTINUINGEDUCATION

Ø DiplomainAdvancedComputingfromC-DAC,MET-IITBandraMumbai,India.

CERTIFICATION

Ø ClouderaCertifiedDeveloperforApacheHadoop–CCDHØ BigdataArchitectbyBigDataTraining.IN

PARTOFPROFESSIONALSOCIETIES

Ø apache.hadoop.orgØ HortonworksUniversityØ ClouderaEngineeringBlog

TITLEOFDISSERTATION

Ø Project“Multi-Tester”.Ø Project“Design&FabricationofPermanentMagnetBrushlessD.C.Motor”.

PRIZES

Ø 1stPrizein“Dipex-1995”(MaharashtraLevel)forProject“Multi-Tester”.Ø 3rdPriceofRs10,000/-inElecrama-1996Exhibition(InternationalLevel)forProject“Multi-

tester”.Ø Stood2ndinDiplomaBoard(BoardofTechnicalExamination,Mumbai,India).

PASSPORT(INDIAN)

Ø M6852233–Validtill2025.

HOBBIES

Ø Reading,Singing,Swimming.

Place:Dublin,Ireland. Mr.PrashantR.Chutke.

Date:_____________ContactNo.(M):+3530899415331