MountainViewRecruiter Since 2001
the smart solution for Mountain View jobs

Site Reliability Engineer (Infrastructure Team)

Company: Assurit
Location: Mountain View
Posted on: July 31, 2022

Job Description:

Apply today by submitting your resume to careers@assurit.com and view our careers page for more information here: https://www.assurit.com/careers/Assurit is currently seeking an experienced Site Reliability Engineer to support one of our clients.Must be local or willing to relocate to Los Angeles, San Francisco, and Seattle and willing to work onsite once return to office occurs. Return to office isn't expected to occur for at least another 4-6 months. Once return to office occurs, will advise that the team members have 30 days to relocate.Responsibilities:

  • Design, write and deliver software and automation to dramatically improve the availability, scalability, latency, and efficiency of infrastructure
  • Improve system design and architecture to ensure high stability and performance of the services across global multi-DC
  • Manage operations of data service, real-time/batch data pipelines, such as SLA management, system deployment, performance tuning on-call and trouble shooting
  • Perform lifecycle management of production systems including change management, service deployment, operations and emergency response
  • Provide strong support during big events to ensure the system is capable to consume large volume of Internet traffic
  • Managing infrastructure services, responsible for including but not limited to deployment, operation and troubleshooting
  • Work with team to establish service level objectives and monitor to ensure the objectives are met
  • Continually improve cloud operations automation and tooling to monitor and maintain enterprise cloud-based infrastructure
  • Execute automation for known cloud-operations tasks, and create new automation for new situations or issues you encounter; automate everything
  • Facilitate blame-free root cause analysis meetings in the event of a production-systems incident so that the team can learn from mistakes and improve our systems and run books
  • Be vigilant about security and adhere to best practices to secure our cloud infrastructure and real-time platformMinimum Qualifications:
    • Ability to code in Python
    • Linux Administration (system administration & network configuration)
    • Debugging & Troubleshooting production performance issues (application and infrastructure)
    • Knowledge of Message Queue tools (Kafka, RabbitMQ)
    • Kubernetes Administration
    • CI/CD Tooling
    • DevOps Automation Preferred Qualifications:
      • Shell Scripting
      • Knowledge of Containers
      • Exposure to distributed systems (Consul, ZooKeeper, MongoDB)
      • Knowledge of SaltStack
      • Knowledge of monitoring tools (Grafana, Prometheus)

Keywords: Assurit, Mountain View , Site Reliability Engineer (Infrastructure Team), Professions , Mountain View, California

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category
within


Log In or Create An Account

Get the latest California jobs by following @recnetCA on Twitter!

Mountain View RSS job feeds