Location: Dallas Metroplex (Hybrid or On-Site)
Employment Type: Full-time | Entry Level
Team: Data Engineering | AI & ML Support
About the Role
We are looking for a junior data engineer to join our fast-growing team supporting generative AI and machine learning initiatives across enterprise environments. You’ll work closely with senior data engineers, AI specialists, and data scientists to build and maintain data pipelines using Azure Data Services and Databricks. This role is ideal for someone passionate about data, cloud technologies, and eager to grow their career in the AI/ML space.
Key Responsibilities
-
Develop and maintain scalable data pipelines using Azure Data Factory, Databricks, and PySpark
-
Support data ingestion, transformation, and integration for GenAI and ML models
-
Collaborate with data scientists and ML engineers to prepare high-quality datasets
-
Assist in optimizing data workflows across ADLS, SQL databases, and Delta Lake
-
Ensure data quality, governance, and compliance practices are followed
-
Monitor and troubleshoot production data jobs and pipelines
-
Contribute to CI/CD practices for deploying data workflows
Preferred Skills & Qualifications
-
Bachelor’s degree in Computer Science, Data Engineering, Information Systems, or related field
-
0–2 years of experience in data engineering or cloud-based data development
-
Familiarity with Azure services like ADF, ADLS Gen2, Azure SQL, and Azure Functions
-
Basic working knowledge of Databricks, Spark (especially PySpark), and SQL
-
Interest or coursework in AI/ML, especially generative AI and LLMs
-
Understanding of version control (e.g., Git) and DevOps concepts
Nice to Have (Not Required)
-
Exposure to Unity Catalog, Delta Live Tables, and MLflow
-
Hands-on experience with data visualization or BI tools (Power BI, Tableau)
-
Knowledge of data security and governance best practices
Why Join Us
-
Be part of transformative AI and data projects impacting real business outcomes
-
Get mentored by senior engineers and AI professionals
-
Grow your technical skills in an Azure and Databricks-focused environment
-
Contribute to GenAI solutions that are shaping the future