Systems Reliability Engineer E-Commerce

Location: SeaTac, WA
Date Posted: 02-15-2018
Our Company is seeking an experienced Systems Reliability Engineer (similar to Site Reliability Engineer) to be responsible for the reliability, resiliency and performance of the technology systems supporting our multibillion, multi-channel e-commerce business. These systems include alaskaair.com, customer mobile apps, loyalty system, and our back-end tier of high scale services. This engineer will be responsible for resiliency, performance of the systems and enhance proactive monitoring, automation, and overall system health.

The ideal candidate will have hands on coding/ scripting experience in the areas of infrastructure automation, instrumenting health monitors. They can build creative engineering solutions to operation problems. They are familiar with DevOps culture and work to spread DevOps culture to their own team and others. They understand agile development values and practices including small, iterative., frequent and continuous value delivery. 

RESPONSIBILITIES

Be a member of high performing team
• Part of the functional team that owns Tier 2 and Tier 3 support for all e-commerce systems 
• Drives a continuous improvement mindset with the team, embracing a DevOps culture by automating everything possible and constantly finding ways to make our systems more reliable
• Be aware of, experiment and adopt emerging industry practices in the systems operations space

Increase System Reliability & Transparency
• Builds dashboards, alerting and monitoring for existing systems so that internal teams know about issues before they impact the customer
• Practice, coach and evangelize reliability best practices
• Works with product teams to establish SLAs around performance that can then be integrated into our monitoring/alerting solutions
• Automates existing manual processes and provides more self-service functionality to Tier 2 team
• Develop engineering solutions to repetitive failures and all other problems that adversely affect production systems

Practice Agile Values and Instill DevOps Culture
• Practice agile principles to organize and deliver work
• Bring modern delivery practices to legacy systems
• Enable software development teams to continuously push their code to production
• Help build container based software delivery to production

QUALIFICATIONS

• Bachelor's degree in Computer Science, or similar technical degree strongly preferred
• 2+ years of hands-on software development experience required
• 2+ years of Reliability Engineering experience is required
• Experience with Git is required
• Proficiency in infrastructure scripting/ configuring Chef/ Bamboo/ Jenkins (similar products)
• Experience in Linux/ AWS/ Windows Azure necessary
• Experience in working in an agile environment necessary
• Expertise in automation tools (such as Jenkins or Chef), as well as monitoring tools (such as AppDynamics, App Insights, Sumo Logic, etc.) required
• Expertise in incident and problem management including timely problem identification, successful resolution and root-cause analysis required 
• Strong verbal and written communication skills to communicate technology concepts and practices
• Experience working in a high-scale, high-traffic, 24/7 environment required
or
this job portal is powered by CATS