ASML Sr DevOps Engineer in San Jose, California
Introduction
ASML brings together the most creative minds in science and technology to develop lithography machines that are key to producing faster, cheaper, more energy-efficient microchips. We design, develop, integrate, market and service these advanced machines, which enable our customers - the world’s leading chipmakers – to reduce the size and increase the functionality of their microchips, which in turn leads to smaller, more powerful consumer electronics. Our headquarters are in Veldhoven, the Netherlands, and we have 18 office locations around the United States including main offices in Wilton, CT, Chandler, AZ and San Jose, CA.
We are proud of bringing together the brightest minds all over the world to tackle the most challenging problems in making faster and smaller chips. As part of our team, you will have the opportunity to go beyond yourself in developing more advanced techniques and push the technology boundaries.
ASML San Jose, CA is an industry leader in computational lithography for integrated circuits. Our Tachyon platform enables capabilities that address chip design, photomask making and wafer printing for semiconductor manufacturing and the LithoTuner platform enables lithography optimization for ASML’s leading photo lithography tools, known as ‘scanners’.
Job Mission
Do you want to be at the core of the world’s computing devices? Do you want to be at the core of a software development company that enables our customers to design the world’s computing devices? Do you have the desire, knowledge, skills and abilities to setup and maintain over 1,000 physical and virtual Linux machines that work together to make the world a better place? Do you want to be a part of making Moore’s law a reality instead of just reaping the benefits? Do you want to be a part of an agile team with the highest integrity and open knowledge sharing that is bold and fearless in taking calculated risks in order to create systems that work together to make tomorrow’s computing systems today? If so, come work within the High Performance Computing team within the Development Operations group at ASML/Brion.
Job Description
We are a diverse and growing team of engineers with many talents that specialize in one of three areas. These areas are Software Configuration Management (SCM), High Performance Computing (HPC), and Tooling (tools that we use internally and as a company to get our job done efficiently and reliably).
This position is for the HPC side of DevOps with about 60-70% system administration and 30-40% software development.
Manage and maintain high performance computing (HPC) cluster (over 14K cores) for development and testing
Manage batch systems (IBM LSF, PBS Pro, Slurm)
Manage object storage backup system (Atempo)
Troubleshoot and find root causes of application and system problems
Perform configuration management via Puppet, Ansible, and custom Python and Bash scripting
Develop internal tools to be used alongside of the batch systems and Puppet
Develop monitoring and availability metric systems
Develop applications that connect to SQL and NoSQL database systems (MySQL, CouchDB, MongoDB)
Document and share internal policies and procedures for working with our systems and tools
Education
Bachelors or Master Degree in Computer Science, Engineering, MIS or related fields or equivalent experience
Experience
Strong working knowledge of programming languages and data structures
Working knowledge of scientific computing
Experience in installing Linux on physical and virtual machine environments
Ability to automate processes by authoring and executing scripts in bash and python
Experience with managing source code repositories, including branching and merging code
Customer centric interpersonal skills and effective verbal and written communication abilities
Experience with configuration management tools such as puppet, chef or Ansibl
Network management experience - configuring and maintaining ethernet and Infiniband network device
Low level network tracing using libpcap derived tool
Continuous integration and continuous deployment experience
Software development in compiled languages such as C, C++, and go
Knowledge of networked filesystems (NFS)
Knowledge of parallel networked filesystems such as Lustre or GPFS
Experience with distributed systems and applications
Personal skills
Solid communication skills in verbal and written English
Context of the position
This position primarily works in an office environment. It requires frequent sitting, standing and walking. Daily use of a computer is required. May stand for extended periods when facilitating meetings. The physical demands of the position described herein are essential functions of the job and employees must be able to successfully perform these tasks for extended periods. Reasonable accommodations may be made for those individuals with real or perceived disabilities to perform the essential functions of the job described.
Other information
EEO/AA (W/M/Vets/Disability) Employer
Location: San Jose - CA, US
Experience: 0-2 Starter
Available since: 4/13/2018
Functional area: Research & Development, Mechanical Engineering, Electrical Engineering
Background: Software Engineering, Information Technology, Computer Science
Reference: req2307