2 days old
2017-12-142018-01-11
  • Job Code
    589421
  • Payrate
    $40 To $47

Data Engineering:


-Cleanse, manipulate and analyze large datasets (Structured and Unstructured data - XMLs, JSONs, PDFs) using Hadoop platform.


-Develop Python, Spark, HIVE scripts to filter/map/aggregate data. Scoop to transfer data to and from Hadoop.


-Manage and implement data processes including Data Quality scripts


-Analysis and Modeling:


-Perform R&D and exploratory analysis using statistical techniques and machine learning clustering methods to understand data.


-Develop data profiling, deduping logic, matching logic for analysis


-Big Data languages such a


-5+ years of experience in processing large volumes and variety of data (Structured and unstructured data, writing code for parallel processing, XMLS, JSONs, PDFs)


-3+ years of programming experience in at least 2 - Python, Spark, Java for data processing and analysis.


-Strong SQL experience


-2+ years of experience - using Hadoop platform and performing analysis.


Familiarity with Hadoop cluster environment and configurations for resource management for analysis works Python, Spark, HIVE for analytics and developing dashboards

Categories

  • Information Technology

Randstad utilizes a technology-driven focus with a human touch to provide better staffing and business solutions to organizations around the world. Our team of experts match professionals with available career opportunities in a variety of fields.

Featured Jobs

Career News

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Data Analyst

Randstad Technologies
McLean, VA 22102

Share this job

Data Analyst

Randstad Technologies
McLean, VA
US

Separate email addresses with commas

Enter valid email address for sender.

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast