Position Overview
At Autodesk, we are reinventing how everything you see or touch is imagined, designed and created. In AEC Construction, we are reinventing how every building, bridge, hospital, campus and stadium is built. Our web and mobile applications bring 3D collaboration solutions to the construction industry. They bring real-time predictive insights and analytics to our customers, helping them better plan, execute and mitigate risk building their multi-million dollar projects. We do this by building, implementing and applying machine learning, data science and real-time data engineering to the Big Data generated by the Autodesk ecosystem. Not only do we help push the boundaries of technology at Autodesk, we do it in style and fun. Intrigued? We are seeking a Principal DevOps Engineer. Come join us and push code to customers on your first day.
Responsibilities
- Work with product development teams to improve performance and reliability for new feature development
- Design, write, document, and deliver software to improve product availability, scalability, and performance
- Develop, train and mentor teammates
- Automate the installation and upkeep of build and deploy tools and dependencies
- Evaluate current build, release and hosting infrastructure and implement enhancements as required
- Define processes, techniques, architectures, and practices to ensure the monitoring, reliability and performance of Autodesk products and services
- Design and implement innovations that improve software engineering velocity, infrastructure resiliency, security, and data availability
- Coordinate and perform major upgrades with zero downtime
- Solve live performance and stability issues and prevent their recurrence
- Work with highly skilled subject matter experts in a follow-the-sun on-call rotation
- Mentor and influence Software Engineering leadership and engineers by motivating improvements
- Help define and build true CI/CD culture, tooling and process
Minimum Qualifications
- Curiosity about how things work
- Obsession about automating everything and a passion for elegance and simplicity
- Advanced knowledge of Unix/Linux systems
- Scripting expertise and an adept with command line
- In-depth understanding of web operations best practices
- Experience with configuration management
- You have managed an infrastructure with hundreds or thousands of servers and dozens of technologies
- Ability to jump into situations with few guardrails and make things better
- Ability and desire to jump in and fix things when they break
- Extended experience with AWS, CloudFormation, Jenkins and CI/CD best practices
- Experience with container infrastructures - Docker, Kubernetes, Mesos
- A love of learning and developing new skills
- 10+ years of Site Reliability work experience and can point to things you've worked on
- Ability to learn rapidly
- Self-motivated, autonomous, empathetic and great communication skills
Preferred Qualifications
- Advanced expert role, requiring deep subject-matter knowledge and sound business acumen to advise leaders
- For jobs at this level, knowledge of the business starts to become as important as subject-matter knowledge
- Incumbents translate specialized subject-matter requirements into business needs/requirements.
- Require depth of specialized expertise to interpret internal/external business issues and recommend best practices to address them
- Solve complex problems that require in-depth evaluation of variable factors by taking a broad perspective to identify the best approach and innovative solutions
- Work independently, with close guidance in only the most complex or unusual situations
- Adapt communication style to persuade various stakeholders across the business and frequently interact with senior leaders to provide advice and SME
- May lead functional teams or programs within own function (or closely related functions)