Cancel
Return to Job Search
Google

Systems Engineer, Site Reliability Engineering

Google

  • UK
  • Not disclosed
  • Permanent full-time
  • Updated 08/02/2013
  • HR Operations
this job is expired

Description

Systems Engineer, Site Reliability Engineering

 

Hi there! We've updated our Jobs site and added some new features to improve your experience. Show me.

This position is based in London, UK.

The area: Technical Infrastructure

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We're always on call to keep our networks up and running, ensuring our users have the best and fastest experience possible.

The role: Systems Engineer, Site Reliability Engineering

Google.com Engineering makes Google's services fast and reliable for hundreds of millions of users. Described as "software engineering for adrenaline junkies", the team combines software development, networking, and systems administration expertise to build and run massively distributed, fault-tolerant software systems and infrastructure. We hire technology mavens who love being in the center of the action, and we routinely tackle complex software and systems issues ranging from distributed change propagation on live serving systems, to designing and deploying cost-aware load balancing systems for the largest user-facing service in the world.

You are a frontline fire fighter, assisting the team get to the end goal and tackle a part of of the huge issues that will arise and making sure it's fixed.

Competitive rates of pay apply. Closing date: 20 November 2012

Responsibilities:

  • Manage availability, latency, scalability and efficiency of Google services by engineering reliability into software and systems.
  • Respond to and resolve emergent service problems and build automation tools to prevent problem recurrence.
  • Participate in service capacity planning and demand forecasting, software performance analysis and system tuning.
  • Review and influence ongoing design, architecture, standards and methods for operating services and systems.

Minimum qualifications:

  • BA/BS in Computer Science or related field (In lieu of degree, relevant skills or equivalent experience).

Preferred qualifications:

  • Experience with Unix systems administration including solid scripting skills in Shell, PHP, Perl, or Python.
  • Expertise in data structures and algorithms.
  • Expertise in analyzing and troubleshooting large-scale distributed systems.
  • Knowledge of IP networking, network analysis, performance and application issues using standard tools like tcpdump.
  • Tack-sharp analytical abilities, coupled with a strong sense of ownership, urgency, and drive.
  • Ability to handle periodic oncall duty as well as out-of-band requests.

Ref: IJ-10019
Report This Job

Google

Google

3 reviews
 

View Employer Profile

Show More

Email me jobs similar to: Systems Engineer, Site Reliability Engineering

Please enter your email address

Please enter a valid email address

We use cookies to customise our website for you, giving you the best possible user experience. If you continue without changing your settings, we’ll assume that you are happy to receive this personalisation. Find out more about our cookie policy

Accept & Close