16 days old

Site Reliability Engineer 2

Austin, TX 78701
  • Job Code
    580821
  • Payrate
    $80,000 To $110,000

Job Description


As a Site Reliability Engineer, you will work with other SRE, Engineers and Developers to ensure maximum performance and availability of our database services and infrastructure. Our Site Reliability Engineer is someone who is familiar with both software and systems engineering with a desire not to just resolve the problem but prevent it in the future.


Responsibilities:



  • Design and architect operational solutions for managing applications and infrastructure, across data centers and cloud providers with the specific goal of increasing the automation, repeatability, and consistency of operational tasks.
  • Design new tools to monitor and alerting that help discover failures in a timely fashion while working with engineers to identify root cause and fix issues
  • Provide basic network administration and troubleshooting.
  • Support and perform maintenance across product and data environments/systems
  • Create scalable alerting and auto remediation systems.
  • Capacity planning for various services
  • Design, write and deliver monitors and dashboards that improve predictability and are actionable in a proactive manner.
  • Day-to-day operational management, including response, incident, event and problem management activities along with tier two support.
  • Participate in on-call rotation duties.


Qualifications


Qualifications:



  • Experience with Linux systems administration and tuning.
  • Understanding of TCP/IP networking.
  • Experience in one or more of: Python, Ruby, Go.
  • Experience with automation tools: Puppet, Chef, Docker, Jenkins and/or Ansible
  • Understand and have implemented Docker and other container based systems
  • Strong passion for automation, testing and code quality
  • Experience with public cloud providers (AWS, Azure, Google Compute.)
  • Comfort with collaboration, open communication and remote teams


Bonus points



  • Has experience using Prometheus and Grafana.
  • Experience with cluster managers like Mesos or Kubernetes.
  • You think of infrastructure and automation as code

Categories

  • Information Technology

Randstad utilizes a technology-driven focus with a human touch to provide better staffing and business solutions to organizations around the world. Our team of experts match professionals with available career opportunities in a variety of fields.

Featured Jobs

Career News

Share this job:

Site Reliability Engineer 2

Randstad Technologies
Austin, TX 78701

Share this job

Site Reliability Engineer 2

Randstad Technologies
Austin, TX
US

Separate email addresses with commas

Enter valid email address for sender.

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast