Sr. Production Engineer (MLOps)
Yahoo
This job is no longer accepting applications
See open jobs at Yahoo.See open jobs similar to "Sr. Production Engineer (MLOps)" SOAR Kentucky.A little about us:
The Production Engineering (PE/SRE/DevOps/MLOps) team is the power behind engineering goodness for all above. By writing, designing and implementing software and infrastructure to drive velocity, operability, reliability and performance, the PE team ensures continuous quality and compliance on production systems. We also build and maintain tools and platforms for engineer productivity and are at the forefront of tech initiatives of infrastructure modernization with cloud platforms and experimenting with new technologies.
Consumer Monetization team’s charter is to find, evaluate, build, and scale new monetization, subscription and internal campaign tools and products, ad formats and functionalities across all Yahoo brands including Yahoo Homepage, Yahoo Sports, Yahoo Finance, Yahoo News and AOL. This team is uniquely positioned to identify growth and revenue generation opportunities, design and implement solutions across consumer products and advertising platforms including video, display, native, and search.
Yes, we are DevOps! We are all about:
Enabling a culture of ownership and excellence.
Engineering processes that are Automated and Agile.
Developing tools that are Self-Serve and (Re)Usable.
Efficiently bring products to market.
Proactively prevent defects from reaching customers.
Swiftly address and resolve any production issues.
About the Role
The Monetization Production Engineering team is seeking experienced DevOps/Cloud Infrastructure engineers with expertise in GCP and a strong knowledge of machine learning frameworks. If you are passionate about working in a dynamic environment and have the required skills, we would love to have you on board with us. Apply now to join our team!
A Lot About You
We are looking for DevOps, Production Engineers (SRE's) who are problem solvers at heart, with solid ability to dig into code and own the reliability domain. As a member of the PE team, you will work with your developer partners and implement operability improvements, security, infrastructure, automation, CI/CD, monitoring and other system requirements. You will also work in development of tools, automation platforms and intelligent monitoring.
You will thrive here if you:
Are a self starter with a passion for solving difficult technical problems, from the network to the application stack
Like to relentlessly automate everything and anything at scale (tens of thousands of servers)
Want to make web applications and backend systems faster, more reliable, more efficient
Can handle fast-paced projects, enjoy having a broad and deep technical knowledge and are driven to iterate with new technologies
Feel comfortable with troubleshooting issues, incident management and doing on-call
Interested in cutting edge technologies, open source and taking products from design to production
Responsibilities
Manage large scale distributed systems by analyzing the existing processes and systems to identify areas of improvement.
Collaborate with the engineering team on the product roadmap and provide support in terms of capacity planning and scaling, operability review, monitoring and alerting, process optimization, security and compliance.
Work on challenging system and network problems to improve system performance and reliability
Develop and maintain tools and frameworks for monitoring and logging of machine learning models in production.
Collaborate with data engineers to ensure efficient data ingestion, transformation, and storage for machine learning applications.
Participate in on-call rotation, identify and resolve technical issues that arise in the production environment by closely working with engineering teams or platform teams.
Stay updated with industry trends and best practices in DevOps, MLOps, and cloud technologies.
Minimum Qualifications
BS/MS in Computer Science or equivalent degree
At least 6+ years experience in DevOps/ PE / SRE and Software Development roles
At least 2+ year of experience in containerization and orchestration technologies (e.g. Docker, Kubernetes), and Google Cloud Platform (GCP)
Intermediate level of coding expertise in one or more language including Java, Nodejs, Python, or Go
Experience with Infrastructure-as-code tools (ex. Terraform).
Proven experience in designing and implementing machine learning infrastructure and pipelines.
Experience with data engineering tools and frameworks
Familiarity with machine learning frameworks (e.g., TensorFlow, PyTorch) and model training workflows, deployment techniques.
Good knowledge of TCP/IP and networking
Preferred Qualifications
Experience in designing, managing large scale infrastructure in GCP, AWS EKS, AWS Open search, multi zone, multi region deployments.
Deep understanding of UNIX/Linux system internals and tools for troubleshooting application stack dumps and networking
Experience working with GitHub Actions
Experience with software based load balancers like Ngnix
Experience with system reliability tools like Open telemetry, Prometheus, Splunk, ELK
Experience with storage solutions like REDIS, DynamoDB.
Experience in deploying and managing machine learning models in a production environment.
Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (www.yahooinc.com/careers/contact-us.html) or call 408-336-1409. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.
At Yahoo, we know that diversity makes us stronger. We are committed to a collaborative, inclusive environment that encourages authenticity and fosters a sense of belonging. We strive for everyone to feel valued, connected, and empowered to reach their potential and contribute their best. Check out our diversity and inclusion (www.yahooinc.com/diversity/) page to learn more.
The compensation for this position ranges from $128,250.00 - $266,875.00/yr and will vary depending on factors such as your location, skills and experience. The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions, in addition to equity incentives. Yahoo provides industry-leading benefits including healthcare, 401K savings plan, company holidays, vacation, sick time, parental leave and an employee assistance program. Eligibility requirements apply.Yahoo has a high degree of flexibility around employee location and hybrid working. In fact, our flexible-hybrid approach to work is one of the things our employees rave about. Most roles don’t require specific regular patterns of in-person office attendance. If you join Yahoo, you may be asked to attend (or travel to attend) on-site work sessions, team-building, or other in-person events. When these occur, you’ll be given notice to make arrangements.
If you’re curious about how this factors into this role, please discuss with the recruiter.
Currently work for Yahoo? Please apply on our internal career site.
This job is no longer accepting applications
See open jobs at Yahoo.See open jobs similar to "Sr. Production Engineer (MLOps)" SOAR Kentucky.