About Us: GE is the world's Digital Industrial Company, transforming industry with software-defined machines and solutions that are connected, responsive and predictive. Through our people, leadership development, services, technology and scale, GE delivers better outcomes for global customers by speaking the language of industry. GE offers a great work environment, professional development, challenging careers, and competitive compensation. GE is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.
Role Summary: As a member of the High Performance Computing as a Service (HPCaaS) team, this role will focus on providing global operational support to stakeholders who use HPCaaS to deliver outcomes. You will implement robust solutions for real-time monitoring and management of the production infrastructure, related telemetry and reporting as well as ensuring SLAs are achieved. You will drive excellence while identifying opportunities to for global consistency in the HPCaaS software, applications and tools.
Essential Responsibilities: In this role you will:
Provide excellent global customer/stakeholder support in a collaborative, consultative style with a constant focus on operational excellence.
Install, configure, manage, run and support large scale Cray supercomputers, Linux-based clusters and related Lustre/NAS storage, visualization and data management services.
Support and maintain a broad spectrum of engineering software packages aligned to technical disciplines such as finite element analysis, computational fluid dynamics and lifing analytics or equivalent.
Build, manage and execute complex infrastructure management and operational projects (i.e. installation, upgrades, and migration/decommission.)
Drive the development of stakeholder solutions which are service-oriented with reusable components that can be leveraged in different ways to meet globally diverse business needs from a broad stakeholder community.
Evolve current legacy service and solution offerings into next generation capabilities aligned to new and emerging business needs.
Embrace change and promote the identification of service improvement opportunities across the global team. Aggressively pursue plans/take action to execute with a continuous improvement mindset.
Identify opportunities to drive the automation of repeatable processes, efficiency and to remove non-value added tasks and overhead from team activities.
Embrace, utilize and promote robust change management processes.
Drive effective written and spoken communication within the team and externally to stakeholders.
Support the adjacent development of next generation workload management and job scheduling tools aligned to and integrated with technical focus areas such as: containerization, cloud services, machine learning, artificial intelligence and big data services.
Partner with leadership and Architects to ensure that service quality meets and exceeds stakeholder expectations.
Balance cost, quality, performance, capability and productivity service variables effectively.
Lead the research and evaluation of new and innovative tools, techniques and strategies aligned to infrastructure management.
Support project teams as well as Principal Architects in developing future technology and solution roadmaps.
Technically mentor junior employees within the function.
Be available to respond to service availability issues and drive resolution in a 7x24 model.
Bachelor’s Degree in Computer Science or in “STEM” Majors (Science, Technology, Engineering and Math).
A minimum of 5 years of professional experience in High Performance Computing (HPC) and/or Simulation Based Engineering and Science (SBES).
A minimum of 5 years providing technical HPC operational support and managing globally distributed end-to-end technical supercomputing and Linux-based cluster solutions (compute, storage, visualization.)
Legal authorization to work in the U.S. is required. We will not sponsor individuals for employment visas, now or in the future, for this job.
Must be willing to work out of an office located in Niskayuna, NY.
Desired Characteristics: Technical Expertise:
Demonstrated ability to design, develop and maintain creative solutions to complex software, engineering toolchain and application problems.
Mission critical systems management experience with enterprise-class compute, storage, network, virtualization and cloud service technologies.
Experience working directly with, administering and maintaining global, large scale Cray supercomputing, Linux compute cluster and Lustre-based storage services, related VMware visualization and data management systems.
Experience working directly with cloud providers/infrastructure (Amazon, Microsoft, etc.), database technologies, operating systems (Windows, Linux) and orchestration tools (Chef, Puppet, etc.)
Experience implementing, scaling, managing and administering infrastructure monitoring and management tools (i.e. NAGIOS, Splunk, HP Openview, Oracle Enterprise Manager, OpsView or equivalent.)
Experience working with emerging HPC GPU technologies, Machine Learning frameworks, Slurm, Hadoop/Big Data frameworks, Docker/Singularity containerization, etc.
Software development exposure – hands-on with programming or scripting languages (preferably shell/perl/python or Java.)
Experience with database / data warehouse technologies and strategies to address management and reporting of metrics, telemetry, asset utilization, etc.
Experience with engineering software aligned to finite element analysis, computational fluid dynamics and lifing analytics or equivalent.
Experience delivering complex technical projects focused on infrastructure management and operations.
Proven experience working with and managing supplier relationships.
Strong analysis and problem-solving skills.
Ability to interact at all levels of the organization and with other GE businesses across cultural, geographic and business boundaries.
Strong understanding of software governance and compliance/regulatory requirements.
Ability to work with cross-functional teams to build effective processes.
Relentless drive and desire for continuous improvement. Challenges the status quo and pursues opportunities to drive service evolution, quality and efficiency while removing waste and non-value added work from team activities.
Proactively identifies and removes project obstacles or barriers on behalf of the team.
Articulates the story; uses two-way communication and influences outcomes and on-going results.
Strong oral and written communication skills including executive level presentation skills.
Effective analytical, problem solving and technical leadership skills.
Self-motivated. Should be able to operate with minimal management supervision.
Demonstrated ability to deliver on commitments to stakeholders.
Demonstrated interpersonal and global teaming abilities.
Works well in a fast paced, agile, adaptive environment.
Wiling to adapt to change, and learn new tools, technologies and processes as needed.
Proactively engages with cross-functional teams to resolve issues and design solutions using critical thinking and analytical skills and best practices.
Process oriented; results, quality, cost driven.
Ability to prioritize and manage multiple complex, competing priorities simultaneously.
We are in the process of transitioning to an improved job application system and in the interim we are operating with two systems. Have your Job ID ready (from the email you received when you applied) to log in and check your application status.
Click the appropriate button. If you don't know your job ID, you can still check your status: use both buttons.