Principal Software Engineer, AI-Powered Advertising Agents
Company: PubMatic, Inc.
Location: Redwood City
Posted on: May 18, 2025
Job Description:
Principal Software Engineer, AI-Powered Advertising AgentsAbout
the rolePubMatic is looking for engineers with expertise in
Generative AI and AI agent development. You will be responsible for
building and optimizing advanced AI agents that leverage the latest
technologies in Retrieval-Augmented Generation (RAG), vector
databases, and large language models (LLMs). You will work on
developing state-of-the-art solutions that enhance Generative AI
capabilities and enable our platform to handle complex information
retrieval, contextual generation, and adaptive interactions.What
You'll Do
- Be the decision maker for using right set of tools & technology
to solve specific problems.
- Guide & Mentor different team members for using generative AI
tools and help the teams build various agents.
- Provide technical leadership and mentorship to engineering
teams while collaborating with architects, product managers, and UX
designers to create innovative AI solutions that address complex
customer challenges.
- Lead the design, development, and deployment of AI-driven
features. Drive end-to-end ownership-from feasibility analysis and
design specifications to execution and release-while ensuring quick
iterations based on customer feedback in a fast-paced Agile
environment.
- Spearhead technical design meetings and produce detailed design
documents that outline scalable, secure, and robust AI
architectures. Ensure that the solutions are aligned with long-term
product strategy and technical roadmaps.
- Implement and optimize LLMs for specific use cases, including
fine-tuning models, deploying pre-trained models, and evaluating
their performance.
- Develop AI agents powered by RAG systems, integrating external
knowledge sources to improve the accuracy and relevance of
generated content.
- Design, implement, and optimize vector databases (e.g., FAISS,
Pinecone, Weaviate) for efficient and scalable vector search, and
work on various vector indexing algorithms.
- Create sophisticated prompts and fine-tune them to improve the
performance of LLMs in generating precise and contextually relevant
responses.
- Utilize evaluation frameworks and metrics (e.g., Evals) to
assess and improve the performance of generative models and AI
systems.
- Work with data scientists, engineers, and product teams to
integrate AI-driven capabilities into customer-facing products and
internal tools.
- Stay up to date with the latest research and trends in LLMs,
RAG, and generative AI technologies to drive innovation in the
company's offerings.
- Continuously monitor and optimize models to improve their
performance, scalability, and cost efficiency.We'd Love for You to
Have
- Strong understanding of large language models (GPT, BERT, T5,
etc.) and their underlying principles, including transformer
architecture and attention mechanisms.
- Proven experience building AI agents with Retrieval-Augmented
Generation to enhance model performance using external data sources
(documents, databases).
- In-depth knowledge of vector databases, vector indexing
algorithms, and experience with technologies like FAISS, Pinecone,
Weaviate, or Milvus.
- Ability to craft complex prompts to guide the output of LLMs
for specific use cases, enhancing model understanding and
contextuality.
- Familiarity with Evals and other performance evaluation tools
for measuring model quality, relevance, and efficiency.
- Proficiency in Python and experience with machine learning
libraries such as TensorFlow, PyTorch, and Hugging Face
Transformers.
- Experience with data preprocessing, vectorization, and handling
large-scale datasets.
- Ability to present complex technical ideas and results to both
technical and non-technical stakeholders.
- Curiosity to learn new things and be up to date with market
trends in Gen AI technology.Nice-to-Have
- Experience in building AI agents using graph-based
architectures, including knowledge graph embeddings and graph
neural networks (GNNs).
- Experience with training small base models using custom data,
including data collection, pre-processing, and fine-tuning models
to specific domains or tasks.
- Familiarity with deploying AI models on cloud platforms (AWS,
GCP, Azure) and containerization technologies (Docker,
Kubernetes).
- Publication or contributions to research in AI, LLMs, or
related fields.
- Proven record of building enterprise scale generative AI
application with specific emphasis on accuracy &
cost.Qualifications
- Should have a bachelor's degree in engineering (CS / IT) or
equivalent degree from a well-known institution or
universityAdditional InformationReturn to Office: PubMatic
employees throughout the global have returned to our offices via a
hybrid work schedule (3 days "in office" and 2 days "working
remotely") that is intended to maximize collaboration, innovation,
and productivity among teams and across functions.Benefits: Our
benefits package includes the best of what leading organizations
provide, such as paternity/maternity leave, healthcare insurance,
broadband reimbursement. As well, when we're back in the office, we
all benefit from a kitchen loaded with healthy snacks and drinks
and catered lunches and much more!Diversity and Inclusion: PubMatic
is proud to be an equal opportunity employer; we don't just value
diversity, we promote and celebrate it. We do not discriminate on
the basis of race, religion, color, national origin, gender, sexual
orientation, age, marital status, veteran status, or disability
status.About PubMaticPubMatic is one of the world's leading scaled
digital advertising platforms, offering more transparent
advertising solutions to publishers, media buyers, commerce
companies and data owners, allowing them to harness the power and
potential of the open internet to drive better business
outcomes.Founded in 2006 with the vision that data-driven
decisioning would be the future of digital advertising, we enable
content creators to run a more profitable advertising business,
which in turn allows them to invest back into the multi-screen and
multi-format content that consumers demand.#LI-HYBRIDCompensation
DisclosureIn accordance with applicable law, the below salary rang
e provided is PubMatic's reasonable estimate of the total
compensation for this role. New hires and current team members are
typically compensated toward the middle of our pay range. The
actual amount may vary, based on non-discriminatory factors such as
location, experience, knowledge, skill s and abilities. In addition
to salary PubMatic also offers a bonus, restricted stock units, and
a competitive benefits package.
#J-18808-Ljbffr
Keywords: PubMatic, Inc., Newark , Principal Software Engineer, AI-Powered Advertising Agents, Sales , Redwood City, California
Didn't find what you're looking for? Search again!
Loading more jobs...