- Jobs
- AHEAD
- GenAI Data ETL Engineer
GenAI Data ETL Engineer
Agentic Frameworks
About the Role
AHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation. At AHEAD, we prioritize creating a culture of belonging, where all perspectives and voices are represented, valued, respected, and heard. We create spaces to empower everyone to speak up, make change, and drive the culture at AHEAD. We are an equal opportunity employer, and do not discriminate based on an individual's race, national origin, color, gender, gender identity, gender expression, sexual orientation, religion, age, disability, marital status, or any other protected characteristic under applicable law, whether actual or perceived. We embrace all candidates that will contribute to the diversification and enrichment of ideas and perspectives at AHEAD. We are seeking a GenAI Data Engineer – Data Integration & Retrieval to design, build, and operate the data pipelines that power our LLM‑based applications, agents, and analytics. This role sits at the intersection of data engineering and generative AI, with a focus on turning messy, distributed enterprise data into high‑quality context for retrieval‑augmented generation (RAG), copilots, and intelligent automation. You will partner closely with the Platform and Use Cases Teams, GenAI/ML engineers, and business stakeholders to deliver robust, observable, and future‑proof data flows that keep us ahead of where the industry is going. Education Minimum Required: Bachelor’s degree in Computer Science, Information Systems, or similar Preferred Skills Experience with LLM‑centric data patterns, such as retrieval‑augmented generation (RAG), semantic search, or document intelligence. Hands‑on experience with vector databases or search technologies (e.g., Pinecone, Weaviate, pgvector, OpenSearch, Elasticsearch, Vespa). Experience with workflow orchestration tools (e.g., Apache Airflow, Prefect, Dagster, Azure Data Factory, AWS Glue workflows). Exposure to message‑based or streaming integrations (e.g., Kafka, Kinesis, Pub/Sub, EventBridge) for near real‑time data and event feeds into GenAI systems. Experience in data quality and observability (e.g., Great Expectations, Monte Carlo, Soda, or custom checks/alerts). Knowledge of at least one cloud platform (AWS, Azure, GCP) and its data/AI services (e.g., object storage, serverless compute, managed warehouses, managed LLMs or embeddings). Familiarity with security and compliance concepts: data classification, encryption, access controls, secrets management, and safe handling of PII/regulated data. Why AHEAD: Through our daily work and internal groups like Moving Women AHEAD and RISE AHEAD, we value and benefit from diversity of people, ideas, experience, and everything in between. We fuel growth by stacking our office with top-notch technologies in a multi-million-dollar lab, by encouraging cross department training and development, sponsoring certifications and credentials for continued learning. India Employment Benefits include: Comprehensive health insurance coverage for employees, with options to extend coverage to dependents Paid time off and company holidays , along with additional leave benefits as per policy Flexible work arrangements , supporting work-life balance Learning and development opportunities to support continuous growth and upskilling Employee wellness initiatives and programs focused on physical and mental well-being Retirement and statutory benefits in line with India regulations Inclusive and people-first culture , with a strong focus on collaboration and ownership