10 days old

Site Reliability Engineer - Big Data Specialty

Bethesda, MD 20817
  • Job Code
job summary:

TITLE: Site Reliability Engineer - Big Data Specialty

LOCATION: Gaithersburg, MD

DURATION: 6-12 month contract with possible extension or hire

- Experience with algorithms, data structures, complexity analysis, and software design

- Experience in one or more of the following: C, C++, Java, Python, Go, Perl, or Ruby

- Interest in designing, analyzing, and engineering large-scale distributed systems

- Systematic problem-solving approach, coupled with strong communication skills

- Ability to work independently and as a member of a greater team, including cross-team activities

- Ability to debug and optimize code

- Understanding of cloud computing concepts, immutability, and pipeline automation

- Experience operating and/or operating within the software development lifecycle (SDLC)

- Undergraduate degree or equivalent experience/certification


- Experience in container operations (Docker, OpenShift Enterprise, GKE, ECS) and orchestration (Docker Swarm, Kubernetes)

- Lead DevOps maturity model through adoption

- Experience within development of the complete application stack inclusive of software engineering and systems engineering responsibilities (e.g. full-stack development)

- Subject Matter Expert understanding of the realm of Big Data and Business Intelligence (BI)

- Big Data Analytics expertise utilizing public cloud native analytics platforms (e.g. Amazon Elastic MapReduce, Google Dataproc, Microsoft Azure HDInsight)

- Requirement gathering, validation, fulfillment and change management

- Infrastructure operations experience including self-healing autonomy


Technical Leadership

- Trains and/or mentors other team members, and peers as appropriate in advances in technology and technical process improvement

- Provides financial input on department or project budgets, capital expenditures or other cost/resource estimates as requested

- Identifies opportunities to enhance the service delivery processes

- KPI creation and tracking for reporting to senior leadership

- Status reports to include project status, risks, issues and schedules

Delivering Technology

- Work closely with Enterprise Architecture to understand system functionality and dependencies

- Design cloud-based technical architectures to meet application and migration functional and non-functional big data requirements

- Run multiple, simultaneous, infrastructure delivery projects to ensure performance, availability and resiliency

- Develop infrastructure as code for repeatable environment provisioning via automated blueprints and solutions

- Ensure encryption and other Marriott Internation data security policies are met

- Concept of Operations documentation for runbook operations

- Performs more complex quantitative and qualitative analyses for service delivery processes and projects.

- Facilitates achievement of expected deliverables and obligations of Services Providers

- Validates completeness of requirements prior to Service Provider solutioning

- Ensures all projects follow the defined development and business case processes

- Ensures proper coordination with appropriate IT and vendor relations teams

- Provides consultation for routine and complex systems development

- Ensures early warning to the business stakeholder executives regarding degraded or missed service levels

- Coordinates with Operations and Infrastructure teams for deployment and production support activities

IT Governance

- Follows all defined IT standards and processes (i.e. IT Governance, SM&G, Architecture, etc.), and provides input for improvements to the appropriate process owners as needed

- Run self-governed security audits to ensure data protection

- Maintains a proper balance between business and operational risk

- Follows the defined project management standards and processes

Service Provider Management

- Validates that Service Providers develop and manage respective aspects of a project plan, including schedules, deliverables, and appropriate metrics.

- Makes short term plans for the team to effectively utilize resources

- Monitors Service Provider outcomes

- Reviews estimates of work effort for client project provided by Service Providers for accuracy

- Facilitates timely resolution of service delivery problems and minimizes the impact to clients

location: Bethesda, Maryland
job type: Contract
work hours: 9am to 5pm
education: Bachelors

The Infrastructure Development, Engineering and Automation (IDEA) Team is looking for a Senior Site Reliability Engineer to join its ranks to serve as a Big Data and DevOps convergence expert, delivering upon continuous integration and continuous delivery within the data analytics domain.

Site Reliability Engineering (SRE) is an engineering discipline which combines both software and systems engineering to build and operate large-scale, massively distributed, fault tolerant systems. Our SREs ensure that our services have the necessary resiliency and up time appropriate to user needs while incorporating a fast rate of change around functional and non-functional improvements.

We strongly believes that SREs are the approach to running better production systems, creating creative engineering solutions to resolve deployment and operations problems across the enterprise, through all major and minor project and product releases.


- 5+ years professional experience with big data analytics platforms and systems

- 5+ years professional experience with distributed processing frameworks (Hadoop)

- 5+ years professional experience with cloud computing technology and its concepts (AWS, Azure, GCP)

- 3+ years professional experience with Database Administration

- 3+ years experience with business intelligence and BI tools (Power BI, Tableau)

- 1+ years experience in configuration management tools (Chef, Puppet, Ansible) or infrastructure engineering

- 1+ year implementing DevOps practices at scale

skills: Key skills:

- Horton Works Data Platform


- Big Data Tools - Apache Spark, Hadoop

Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.


Posted: 2018-10-08 Expires: 2018-11-05

Featured Jobs

Career News

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Site Reliability Engineer - Big Data Specialty

Randstad Technologies
Bethesda, MD 20817

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast