San Francisco • Financial District
What will you do?
The Engineering Operations team is responsible for the health and reliability of our engineering infrastructure (100% in Cloud Computing platforms), including our Continuous Delivery pipeline, Microservices cluster, and Docker fleet. Members of this team work to ensure our service is secure, scalable, and provides rock-solid reliability, while still working to keep our feature teams delivering quickly and efficiently.
- Participate in managing development and production infrastructure environments.
Configure and manage deployments to our Amazon Web Services infrastructure.
Design and implement the intelligent evolution of our cloud architecture, including capacity planning, increased availability, resiliency, monitoring and alerts, security, and deployment strategy.
Grow and extend our monitoring and alerting systems.
Develop and document new processes, and make it a priority to assist in knowledge sharing with the
team and the company.
Build new tools and services to help improve the developer experience.
Participate in on-call rotation.
Help be a driving force for improvements.
Must have direct experience with the following:
- Managing Production Linux or Unix systems on one or more cloud computing platforms (such as Amazon Web Services or Google Cloud Platform).
Using configuration management software (such as Chef, Puppet, or Ansible) to deliver software or files to multiple systems or classes of systems.
Using Version Control Software (such as Git, Mercurial, or Subversion) to work on a multi-contributor code project.
Troubleshooting hardware or virtualized infrastructure issues as they occur, and responding in a calm, rational manner.
Contributing to, and authoring, documentation based on what you have experienced and learned.
Automating the more tedious parts of your current role away, or ability to explain how you would do so.
Using container technology (such as Docker) to run multiple services independently on the same hardware.
Additional preferred experience:
Worked with large datasets and storage needs, in the range of Petabytes or larger.
Have integrated into multiple engineering teams, and can integrate into different workflows and toolsets easily.
Worked with, and feel confident reading and contributing to, multiple interpreted and compiled programming languages.
Used scheduling systems (Such as Apache Mesos or Kubernetes) to coordinate service availability.
Have a strong understanding of security best practices in the Public Cloud (such as Amazon Web
Services or Google Cloud)
Competitive salary and bonus program
Exceptional vacation and holiday plan
Competitive medical benefits
Employee equity and 401(k) plans
Wellness program, including in-office yoga and massages
Downtown SF - close to BART, Muni, Caltrain
Commuter Benefit Plan
Active Fun Team with full calendar of social events
Commitment to professional development
NOTE: San Francisco Bay Area candidates only please. There is no relocation assistance or agency fee budget available for this position. Candidates from third party firms will NOT be considered for this opening. Candidates requiring sponsorship will not be considered at this time. Resumes received from a third party will be considered as no-fee referrals. Our company is committed to Equal Employment Opportunity.