Lead AI QA Engineer
Skills
About the role
Join our India Tech Hub – Be among the first hires!
Kobie, a 35-year veteran of the loyalty industry, a multi-year Forrester Leader, and USA Top Workplace is expanding its global footprint by establishing a Tech Hub in India. Kobie partners with global brands to build deep connections with their customers through personalized, data-driven loyalty experiences and has a mission of growing enterprise value through loyalty. The Tech Hub will serve as a Global Capabilities Center for a broad range of technology roles, and this is your chance to play a pivotal role in shaping our presence in India. Join us as we continue to lead in loyalty, delivering innovative customer experiences for some of the world’s most recognized brands while working alongside some of the best and brightest in loyalty.
About the team and what we will build together
We’re looking for an Lead AI QA Engineer with 6+ years of experience who thrives on designing test strategies and evaluation harnesses for production-grade, agentic AI systems in addition to experience in ETL. You have strong Python skills, hands-on experience testing LLM-powered features (prompt regression, tool/function-call validation, RAG correctness, and structured-output schema checks), and working knowledge of evaluation frameworks such as RAGAS, DeepEval, LangSmith or Langfuse. You are comfortable writing solid SQL, automating tests with PyTest, exercising APIs through Postman or REST clients, and shipping test pipelines using Git, Docker and CI tooling like Jenkins or GitHub Actions.
Kobie runs some of the largest loyalty programs in the world. We are building an internal agent platform on Dataiku that automates analyst workflows, surfaces insights from program data in Snowflake, and gives our teams an LLM-native way to work with complex loyalty logic. As an Lead AI QA Engineer on the India Tech Hub team, you will play a key role in protecting that platform — designing golden datasets, running LLM-as-judge and regression suites, and owning the quality bar for what goes to production. This is not a manual-only role: you will automate, build qa & automation strategies, roadmaps, instrument, monitor and partner closely with our U.S. AI & Innovation team and cross-functional partners across Engineering, Data, AI and Product.
How you will make an impact
Design and build evaluation harnesses for agentic systems in Python — golden datasets, LLM-as-judge graders, multi-turn regression suites and trace-based assertions. In addition, develop framework to verify generated AI output.
Author automated test suites for prompts, tools, structured outputs (Pydantic / JSON schema), retrieval pipelines (ETL Experience) and end-to-end agent workflows
Validate guardrails around tool execution: auth scoping, input/output validation, PII and prompt-injection protections, and hallucination mitigation
Wire evaluations into CI using Dataiku Evaluations, GitHub Actions or Jenkins so every change is graded against quality, safety and cost SLOs before it ships
Build observability into testing by instrumenting traces with LangSmith, Langfuse, MLflow or OpenTelemetry and triaging production drift back into the eval harness
Own quality end-to-end — define release criteria, run pre-prod and shadow tests, and partner with engineering to root-cause and fix regressions quickly
Partner with data engineers on Snowflake-backed retrieval testing patterns (Cortex Analyst and Cortex Search Services) and with platform teams on observability, security and cost
Help shape internal QA standards for AI & Data engineering as the stack evolves, contributing to design reviews and sharing knowledge across the India and U.S. teams
Participate in a collaborative DevOps environment, working closely with developers, AI engineers, Data Engineers, DBAs and product partners across environments
In your first 90 days
By the end of your first 90 days, you will have stood up at least one production-grade evaluation harness — golden dataset, LLM-as-judge graders and regression suite — wired into CI for an internal agent. You will have automated trace-based assertions running against staging traffic, a clear quality scorecard for at least one shipped agent, and a clear opinion about what our next testing investment should be.
What you need to be successful
3+ years of professional QA / SDET experience, with production experience automating tests for backend services or data pipelines
1+ years of hands-on experience testing LLM or AI features in production: prompt regression, tool / function-call validation, structured outputs and RAG correctness
Working knowledge of evaluation frameworks such as RAGAS, DeepEval, LangSmith, Langfuse or comparable LLM-as-judge tooling
Strong Python and PyTest skills; solid SQL skills and comfort with at least one cloud platform (AWS, Azure or GCP)
Fluency with Git, Docker, REST APIs and at least one CI tool (GitHub Actions, Jenkins, GitLab CI or CircleCI)
Solid understanding of data security and responsible AI practices, particularly in PCI-compliant or regulated environments
Proven ability to work independently and within a team, managing priorities across concurrent projects and time zones
Strong written and verbal communication skills; able to work effectively with both technical and non-technical stakeholders
A bachelor’s degree is not required — equivalent practical experience (including bootcamps, self-taught work, career changes or non-CS technical degrees) counts
Bonus Skills:
Hands-on experience with Dataiku DSS (Python / SQL recipes, scenarios, code environments, the dataiku and dataikuapi clients) or Dataiku Evaluations
Experience with Dataiku LLM Mesh, Knowledge Banks, Prompt Studio, or Visual / Code Agents
Experience with Snowflake, Snowpark, or Snowflake Cortex (Search, Analyst, Agents)
Experience with red-teaming, prompt-injection testing or adversarial test generation for LLMs
Familiarity with multi-agent patterns: supervisor / router, subagent / handoff, reflection, human-in-the-loop
Experience with performance and load testing tools such as Locust, JMeter or k6
ISTQB, AI Testing or comparable QA certification
Experience in loyalty, martech, adtech or a comparable data-rich B2B domain
About Kobie
Named a Top Workplace in the USA and Top Remote Workplace, Kobie is where the best minds in loyalty come together, driven by passion and innovation. We’re always looking for talented individuals ready to join a collaborative, growth-focused culture.
As a trusted partner to some of the world’s most recognized brands, we are loyalty leaders, helping brands build lasting emotional connections with their consumers. We do this with a strategy-led technology approach that uncovers the truth behind what drives consumers on an emotional level.As we launch our India Tech Hub, we are excited to bring our award-winning culture to a new region - creating an environment where collaboration, flexibility, and career growth come together to build something truly special.
We are proud to be the only loyalty provider to be externally recognized for their culture. We believe people thrive when they feel valued, supported, and empowered to be their authentic selves. Our commitment to diversity, equity, and inclusion ensures every teammate has a voice and the opportunity to be heard.
Giving back is in our DNA at Kobie,through an annual fundraiser, charitable partnerships, and volunteer opportunities, we encourage our teammates to make a difference in their communities.
To support our teammates beyond just their careers, we offer highly competitive benefits, comprehensive health coverage, and well-being perks that support our teammates and their dependents. We understand the importance of time for life outside of work - recognizing public holidays,offering flexible time off, and prioritizing work-life balance.
As we into India, our new teammates will be fully integrated with our U.S. teams, working on global projects and gaining exposure to top industry leaders. With continued growth, we will establish a physical office in Bengaluru, India, giving teammates aspace for collaboration and fostering connection.
Now is the perfect time to join Kobie. Be part of something big and help shape the future of our global capabilities center, the Kobie India Tech Hub.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Questions about this role
How do I apply to this Lead AI QA Engineer role at Kobie Marketing?
Click "Apply with AI Applyd" above. We auto-fill the application from your resume and answer screening questions in seconds. No copy and paste, no juggling tabs.
What's the typical salary for QA Engineer in India?
Compensation for QA Engineer roles in India varies widely by seniority, employer size, and remote vs onsite arrangement. Check the salary range on this listing when published, or browse our QA Engineer hub for India medians across recent openings.
How fast does AI Applyd auto-apply?
Most applications complete in under 90 seconds. You can track the status in your dashboard and watch the screenshot proof land the moment the application submits.
What ATS does Kobie Marketing use?
AI Applyd supports Greenhouse, Lever, Ashby, Workday, iCIMS, SmartRecruiters, LinkedIn Easy Apply, and most other ATS platforms. If we can submit through the platform, we do.
Want AI Applyd to auto-apply to roles like this?
We tailor your resume per posting, fill the forms, and track replies for you.