6 days old

Senior Site Reliability Engineer (SRE)

Multiple Cities, Multiple
  • Job Code

As anIBM Application Architect, you directlyhelp clients transformtheir businessandsolve complex problems. You will define the scope and vision for projects that deliver customized solutions using your knowledge of IBM platforms. You are a technical leader, serving as a liaison among business partners, technical resources, and project stakeholders.

Your Role and Responsibilities
The Global Business Services Cloud Center of Competency is a leader in Cloud solutions. We are seeking a Senior Site Reliability Engineer (SRE) to join our team.

Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that companys servicesboth internally critical and externally-visible systemshave reliability and uptime appropriate to users' needs and a fast rate of improvement while keeping an ever-watchful eye on capacity and performance.
SRE is a software engineer who knows how to apply engineering principles to operations. You have demonstrable experience managing or developing a multi-tenanted & multi-cloud solutions. You are well versed in a large number of technologies and welcome new tools and techniques. You work in conjunction with fellow developers and operations members to come to the best possible solution. You are always looking for patterns and ways to increase efficiency, eliminate downtime, optimize costs, and maintain performance at scale. You will also advise our clients on SRE value proposition, adoption, industry best practices, and implementation strategy.
In this role, you will work as the lead member of the Site Reliability team with the following key responsibilities:
  • Work with the development teams to design scalable, robust systems using cloud architecture.
  • Build automation using industry tools (like Jenkins, Ansible, etc.) to deploy hundreds of different services.
  • Ensure a high degree of availability across all of our service offerings.
  • Be proficient in one or more cloud providers, including IBM Cloud, RedHat OpenShift, AWS, Azure, GCP.
  • Have experience writing applications using Java, C#, Python, or JavaScript.
  • Identify bottlenecks and problems throughout the infrastructure.
  • Prefer to build automation to perform redundant tasks rather than manually handling toil.
  • Enjoy pushing scalability to the limit with high throughput services.
  • Design solutions with failure in mind to ensure reliability.
  • Enjoy working with a large variety of services and technologies.
  • Like looking through metrics and logs as if it were a treasure hunt.
  • Avoid logging into servers directly and prefer using automation and aggregation to manage them.
  • Strive to be a responsible enabler rather than a "gate".

This position requires up to 100% Global Travel. You must be willing to travel.

Required Technical and Professional Expertise
  • 12+ years experience as with systems and/or software engineering.
  • 5+ years experience with software development.
  • 5+ years experience with systems engineering.
  • 5+ years experience troubleshooting software.
  • 2+ years experience leading a team.
  • Experience in a DevOps environment.

Preferred Technical and Professional Expertise
  • Strong experience with Git or equivalent source code repository.
  • Experience with OpenStack or similar proprietary cloud like IBM Cloud, OpenShift, AWS, Azure, GCP.
  • Experience with CI/CD and their pipelines; experience with Zuul, Jenkins, or Bamboo a plus.
  • DevOps experience working with Ansible, Puppet, or Chef a plus
  • Experience with containers and HA clusters; experience with Docker and Kubernetes a plus.
  • Excellent knowledge of TCP/IP networking.
  • Strong background in network engineering.
  • Hands-on data center operational experience.
  • Proven ability to collaborate and work well within a team.
  • Ability to communicate effectively both verbally and in writing.
  • At least 10 years of experience consulting

About Business Unit
IBM Services is a team of business, strategy and technology consultants that design, build, and run foundational systems and services that is the backbone of the world's economy. IBM Services partners with the world's leading companies in over 170 countries to build smarter businesses by reimagining and reinventing through technology, with its outcome-focused methodologies, industry-leading portfolio and world class research and operations expertise leading to results-driven innovation and enduring excellence.

Your Life @ IBM
What matters to you when youre looking for your next career challenge?

Maybe you want to get involved in work that really changes the world? What about somewhere with incredible and diverse career and development opportunities where you can truly discover your passion? Are you looking for a culture of openness, collaboration and trust where everyone has a voice? What about all of these? If so, then IBM could be your next career challenge. Join us, not to do something better, but to attempt things you never thought possible.

Impact. Inclusion. Infinite Experiences. Do your best work ever.

About IBM
IBMs greatest invention is the IBMer. We believe that progress is made through progressive thinking, progressive leadership, progressive policy and progressive action. IBMers believe that the application of intelligence, reason and science can improve business, society and the human condition. Restlessly reinventing since 1911, we are the largest technology and consulting employer in the world, with more than 380,000 IBMers serving clients in 170 countries.

Location Statement
For additional information about location requirements, please discuss with the recruiter following submission of your application.

Being You @ IBM
IBM is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.



  • Computers Software and Hardware
Posted: 2019-11-14 Expires: 2019-12-14

Featured Jobs

Career News

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Senior Site Reliability Engineer (SRE)

Multiple Cities, Multiple

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast