Required experience includes:
10+ years experience
Ability to troubleshoot and diagnose low level Linux issues including kernel internals
Experience troubleshooting microservice interdependencies
MySQL, MongoDB, Cassandra or other database expertise (you name it, we probably use it)
AWS, Azure, or equivalent public cloud experience
Docker / Mesos / Kubernetes
Automation & Orchestration!
Previous experience mentoring and/or leading a team
Strong programming skills in ruby / python
You have some exposure to CI/CD and have been involved in improving pipelines
Ideally you can answer most of these questions:
What’s an availability zone?
Why does Munin not scale?
What’s worse, jitter or latency?
Why would you consider serverless architectures?
Do you always answer 'what if...?' questions when designing systems?
What is the best DB technology to use for geo replication?
What are the five why’s?
Have you developed a world where monitoring and deployment is automated and systems intelligently scale, heal and manage themselves?
What about the bonus skills?
Contributed to open source projects and are on GitHub/ServerFault/StackOverflow
Run or attend meetups on DevOps technologies
Extensive knowledge of H.323, SIP, Microsoft Lync and any other buzzworthy technology in video / voice conferencing
Experience with BGP, OSPF, ECMP, SDN other buzzworthy networking TLAs
Windows server administration
What would a typical day be like?
On-call rotates, just wanted to get that out of the way
Ops works closely with the dev/QA/product management teams, very closely, so you must play nice with others