Location(s): United States; Georgia; Alpharetta, Atlanta
About Us: GE is the world's Digital Industrial Company, transforming industry with software-defined machines and solutions that are connected, responsive and predictive. Through our people, leadership development, services, technology and scale, GE delivers better outcomes for global customers by speaking the language of industry. GE offers a great work environment, professional development, challenging careers, and competitive compensation. GE is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.
Role Summary: Sr. DevOps Engineer will be responsible for performance and availability of Compute infrastructure consumed by all business segments. The Site Reliability teams are composed of highly talented individuals obsessively focused with availability through operational excellence. The ideal individual is relentlessly technical, passionate for automating everything and totally committed to delivering amazing customer experiences.
As a Sr. DevOps Engineer, you must have an excellent understanding of standard IT infrastructure equipment and systems – reliability and failure causes, the ability to quickly understand the key operational characteristics of new equipment and systems, interview domain experts for failure mode knowledge, and assess how possible failure models will affect measured parameters and key performance indicators (KPIs).
In catastrophic situations available 24x7 to quickly respond and resolve critical service outages severely impacting consumers.
Develop automated solutions to address potential problems before they result in a service interruption.
Build dashboards, bots, micro services to address the operational issues.
Utilize logging system such as Splunk (or ELK) and monitoring such as Sensu or similar to create a robust system.
Provide impact assessment and mitigation plan for changes going into the production environment.
Develop availability measures that align with consumer experience to accurately assess the usability of crucial services.
Programmatically monitor for and remediate configuration drift of critical devices.
Develop response plans to potential failure points and evaluate effectiveness during planned tests.
Perform comprehensive operational health checks of the entire services to identify areas of concern and track activities to drive improvements at all levels of the architecture.
Provide technical coaching and direction to more junior teammates Qualifications/Requirements
Bachelor's degree in Computer Science, Information Management, similar STEM degree, or in lieu a high school diploma with equivalent years of experience.
Minimum 5 years of Dev Ops experience in enterprise-wide deployments.
1 to 2 years’ experience in AWS – creating payloads, using AWS tools for Dev Ops pipeline and data analytics
2 to 3 years’ experience in developing software systems using micro services architecture (Docker or NetFlix or Kubernetes)
Legal authorization to work in the US is required. GE may agree to sponsor an individual for an employment visa now or in the future if there is a shortage of individuals with particular skills.
Experience working with Operating systems (Windows, Unix/Linux) from kernel to shell and beyond, with experience working with system libraries, file systems, and client-server protocols.
Networking: experience with network theory e.g. TCP/IP, UDP, ICMP, etc., MAC addresses, IP packets, DNS, OSI layers, and load balancing
Demonstrated experience scripting or developing software and services for the cloud with one or more of the following automation scripting coding languages (Shell, Powershell, Ansible, Python, Chef, Puppett, Bash, Ruby, Python, Go, Java, Node.js, .NET, etc.)
Experience managing version control systems such as Git
Experience deploying and managing infrastructure on public clouds such as AWS
Experience using an automated configuration management system (Terraform, Chef, Puppet, Ansible, Salt, etc.)
Engineer should be able to design and develop large scale systems. Aware of design patterns and design practices
Knowledge of Network Management (SNMP, MIB)
Experience with configuring, customizing, extending
We are in the process of transitioning to an improved job application system and in the interim we are operating with two systems. Have your Job ID ready (from the email you received when you applied) to log in and check your application status.
Click the appropriate button. If you don't know your job ID, you can still check your status: use both buttons.