Systems Development Eng (AWS Generative AI & ML Servers), AWS Hardware Engineering Accelerators
Company: Amazon
Location: Cupertino
Posted on: April 5, 2026
|
|
|
Job Description:
Do you want to build the backbone of Generative AI cloud at AWS?
Do you want to build the future of the cloud for AI training and
inference? Want to do industry leading work delivering continuous
price performance improvements in the cloud for AI model training
for multi billion variable LLMs? Come Join us in designing,
delivering and operating AWS cloud offerings that enable high
performance and scalability in AI/ML and HPC workloads. You are
intrigued by the continuous release of newer AWS services and
instance types that solve newer, bigger and more interesting
business problems every day? Does that make you wish your talents
were applied to those at cloud scale? If yes, then come join us -
we are looking for builders like you. The AWS Hardware Engineering
team creates server designs for Amazon’s innovative web services.
Our designs are industry-leading in frugality and operational
excellence, and are critical to the success of the AWS business and
millions of customers who use AWS today. Our engineers solve
challenging technology problems, and build architecturally sound,
high-quality components to enable AWS to realize critical business
strategies. The ideal candidate for this role will be an innovative
self-starter. You are knowledgeable of the full technical stack -
vertically from baremetal server hardware up to the software in
userland, and everything in the middle. You have tremendous
interest in cloud scale and curious how systems and software
decisions impact the user. You insist on highest-standards and are
able to develop tactical solutions/tools to diagnose and fix
issues. You are an excellent systems debugger - finding interaction
issues between components on server systems. You are a leader with
strong organizational, planning, and communication skills. You are
a builder! What you will do? You will work with engineers across
the company for delivering the next-generation AWS platforms. You
will have a direct impact on our bottom line and the ability to
deliver improvements for AWS. You will be part of a growing, fast
paced, and fun team. You will have ownership for the implementation
of your work. You will see direct product improvements based on the
results of your work. AWS Engineers are shaping the way people use
computers and designing the future of cloud computing technology –
come help us make history! Why it matters? Public cloud IT services
represent the majority of growth in the overall IT services market
and will continue to do so for several years to come. The scale of
AWS, combined with an understanding of how our software and
hardware is used, creates a unique opportunity for component
customizations that will directly benefit our customers. Why you
will love it? You will work with engineers across the company for
delivering the next-generation AWS platforms. You will have a
direct impact on our bottom line and the ability to deliver
improvements for AWS. You will be part of a growing, fast paced,
and fun team. You will have ownership for the implementation of
your work. You will see direct product improvements based on the
results of your work. Key job responsibilities You will be a
technical leader solving complex architectural problems which may
not defined before hand. You will be owning the teams systems and
work proactively in identifying deficiencies, writing tactical code
to solve issues before they impact customers, and working with your
team to scale the solution. You will decompose big difficult server
system testability, reliability and diagnosis problems into
straightforward tasks, components or features that you will lead to
deliver yourself and through others in parallel. You will use
combination of hardware, software, system designs, x86
architecture, processes, diagnosis and operations knowledge. A day
in the life Working with a variety of job roles (SDEs, SDETs,
Hardware Engineers, TPMs, Managers, Principals) and groups (AWS
Hardware Engineering, EC2, other AWS services) through server
conception, test, launch, and operations. Driving high quality and
reliability into future/new designs for AWS Accelerated server
solutions for AWS Cloud. About the team The Hardware Engineering AI
/ ML development team is a group of engineers and technical program
managers directly responsible for launching hardware in the fleet.
Located out of Seattle, Cupertino and Austin we work on programs
with global development teams (both internal and external to
Amazon). Our servers are located in datacenters globally. Why AWS
Amazon Web Services (AWS) is the world’s most comprehensive and
broadly adopted cloud platform. We pioneered cloud computing and
never stopped innovating — that’s why customers from the most
successful startups to Global 500 companies trust our robust suite
of products and services to power their businesses. Utility
Computing (UC) AWS Utility Computing (UC) provides product
innovations — from foundational services such as Amazon’s Simple
Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to
consistently released new product innovations that continue to set
AWS’s services and features apart in the industry. As a member of
the UC organization, you’ll support the development and management
of Compute, Database, Storage, Internet of Things (IoT), Platform,
and Productivity Apps services in AWS, including support for
customers who require specialized security solutions for their
cloud services. Inclusive Team Culture Here at AWS, it’s in our
nature to learn and be curious. Our employee-led affinity groups
foster a culture of inclusion that empower us to be proud of our
differences. Ongoing events and learning experiences, including our
Conversations on Race and Ethnicity and AmazeCon conferences,
inspire us to never stop embracing our uniqueness. Work/Life
Balance We value work-life harmony. Achieving success at work
should never come at the expense of sacrifices at home, which is
why we strive for flexibility as part of our working culture. When
we feel supported in the workplace and at home, there’s nothing we
can’t achieve in the cloud. Mentorship and Career Growth We’re
continuously raising our performance bar as we strive to become
Earth’s Best Employer. That’s why you’ll find endless
knowledge-sharing, mentorship and other career-advancing resources
here to help you develop into a better-rounded professional.
Diverse Experiences Amazon values diverse experiences. Even if you
do not meet all of the preferred qualifications and skills listed
in the job description, we encourage candidates to apply. If your
career is just starting, hasn’t followed a traditional path, or
includes alternative experiences, don’t let it stop you from
applying. - 6 years of programming with at least one modern
language such as C++, C#, Java, Python, Golang, PowerShell, Ruby
experience - 5 years of non-internship professional software
development experience - 5 years of designing or architecting
(design patterns, reliability and scaling) of new and existing
systems experience - 4 years of systems development in an IT or
data center environment experience - 4 years of deploying and
operating in a Linux/Unix environment experience - 5 years of
systems design, software development, operations, automation, and
process improvement experience - Experience leading the design,
build and deployment of complex and performant (reliable and
scalable) software solutions in production - Knowledge of
engineering practices and patterns for the full
software/hardware/networks development life cycle, including coding
standards, code reviews, source control management, build
processes, testing, certification, and livesite operations -
Experience taking a leading role in building complex software or
computing infrastructure that has been successfully delivered to
customers - Experience using managed ML/AI solutions Amazon is an
equal opportunity employer and does not discriminate on the basis
of protected veteran status, disability, or other legally protected
status. Los Angeles County applicants: Job duties for this position
include: work safely and cooperatively with other employees,
supervisors, and staff; adhere to standards of excellence despite
stressful conditions; communicate effectively and respectfully with
employees, supervisors, and staff to ensure exceptional customer
service; and follow all federal, state, and local laws and Company
policies. Criminal history may have a direct, adverse, and negative
relationship with some of the material job duties of this position.
These include the duties and responsibilities listed above, as well
as the abilities to adhere to company policies, exercise sound
judgment, effectively manage stress and work safely and
respectfully with others, exhibit trustworthiness and
professionalism, and safeguard business operations and the
Company’s reputation. Pursuant to the Los Angeles County Fair
Chance Ordinance, we will consider for employment qualified
applicants with arrest and conviction records. Our inclusive
culture empowers Amazonians to deliver the best results for our
customers. If you have a disability and need a workplace
accommodation or adjustment during the application and hiring
process, including support for the interview or onboarding process,
please visit
https://amazon.jobs/content/en/how-we-hire/accommodations for more
information. If the country/region you’re applying in isn’t listed,
please contact your Recruiting Partner. The base salary range for
this position is listed below. Your Amazon package will include
sign-on payments and restricted stock units (RSUs). Final
compensation will be determined based on factors including
experience, qualifications, and location. Amazon also offers
comprehensive benefits including health insurance (medical, dental,
vision, prescription, Basic Life & AD&D insurance and option
for Supplemental life plans, EAP, Mental Health Support, Medical
Advice Line, Flexible Spending Accounts, Adoption and Surrogacy
Reimbursement coverage), 401(k) matching, paid time off, and
parental leave. Learn more about our benefits at
https://amazon.jobs/en/benefits . USA, CA, Cupertino - 173,900.00 -
235,200.00 USD annually USA, TX, Austin - 151,200.00 - 204,600.00
USD annually USA, WA, Seattle - 151,200.00 - 204,600.00 USD
annually
Keywords: Amazon, Newark , Systems Development Eng (AWS Generative AI & ML Servers), AWS Hardware Engineering Accelerators, IT / Software / Systems , Cupertino, California