Back to jobsJob overview
About the role
Data Engineer I at ADCI - BLR 14 SEZ
Required Skills
pythonsparkawssqletldata pipelinesragdata modelingapi development
About the Role
The Data Engineer I role supports a GenAI-powered insights assistant initiative by building and scaling ingestion and embedding pipelines for unstructured knowledge bases. This position involves ensuring the retrieval-augmented generation system accesses fresh, relevant document embeddings to enhance AI-driven insights and user query satisfaction.Key Responsibilities
- Build batch and streaming data pipelines using Spark and AWS streaming services
- Implement automated checks to ensure data consistency across different data types
- Define and maintain data contracts with source teams to keep schemas consistent
- Develop cross-domain metadata services linking structured and unstructured data catalogs
- Create APIs and event-driven workflows integrating AI insights with business tools
Required Skills & Qualifications
Must Have:
- 1+ years of data engineering experience
- Experience with data modeling, warehousing and building ETL pipelines
- Experience with one or more query language (e.g., SQL, PL/SQL, DDL, MDX, HiveQL, SparkSQL, Scala)
- Experience with one or more scripting language (e.g., Python, KornShell)
Nice to Have:
- Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
- Experience with any ETL tool like, Informatica, ODI, SSIS, BODI, Datastage, etc.
- Familiarity with RAG (Retrieval-Augmented Generation) principles
- AWS experience: Lambda, S3, SageMaker, Bedrock Knowledge Bases