Data Engineer, OIS/CXI Analytics
Analytics
Artificial Intelligence
AWS
Big Data
Business Intelligence
Cloud
Cloud Architecture
Data Analytics
Data Engineer
Data Governance
Data Integration
Data Pipeline
Data Platform
Data Processing
Data Warehouse
DevOps
ETL
Integration
Kinesis
Machine Learning
Ml Ops
Production Analytics
Reporting and Analytics
Job Description
Within the OIS/CXI Analytics group, this Data Engineer role centers on building scalable data pipelines and ML-ready data infrastructure to empower AI-driven operational insights across Amazon’s fulfillment and operations network. The position emphasizes production-grade ETL/ELT development, feature engineering workflows, data governance, GenAI-enabled reporting, and close collaboration with ML engineers, data scientists, and stakeholders to deliver reliable, data-driven decisions.
Details
- Location: Nashville, TN (onsite)
- Salary: USD 125,500 - 169,800 per year
- Minimum experience: 3 years
- Education: Bachelor's degree or higher
Responsibilities
- Architect and maintain production grade ETL/ELT pipelines and large-scale data infrastructure to support OTS operational intelligence
- Develop feature engineering workflows and ML-ready data pipelines to enable data science experimentation and production model serving
- Contribute to data governance and quality standards across analytical and ML data products
- Assist in implementing GenAI solutions for automated reporting, diagnostics, predictive and prescriptive analytics
- Construct and manage semantic layers and dashboard data models that inform global operations decisions
- Collaborate with Program Managers, BI teams, ML engineers, data scientists, and operational stakeholders to prioritize work aligned with OTS goals
- Adhere to and contribute to data engineering best practices, including code reviews, testing, monitoring, and documentation
Requirements
- 3+ years of data engineering experience
- 3+ years designing and operating large-scale BI data structures with data modeling experience
- Experience in data modeling, data warehousing, and building ETL pipelines
- Hands-on with AWS technologies such as Redshift, S3, AWS Glue, EMR, Kinesis, Firehose, Lambda, and IAM roles/permissions
- Background in data warehouse architectures, data modeling, infrastructure components, ETL/ELT and reporting/analytic tools, data structures, and practical SQL coding
- Bachelor's degree or higher in computer science, engineering, or related fields, or equivalent experience building and maintaining data flows
- Proficiency in Python and SQL; experience with PySpark or Apache Spark
- Experience with infrastructure-as-code (CDK, CloudFormation) and CI/CD pipelines for data and ML systems
- Experience with data modeling and designing relational and non-relational databases
Technologies
- Python
- SQL
- PySpark
- Apache Spark
- Redshift
- S3
- AWS Glue
- EMR
- Kinesis
- FireHose
- Lambda
- IAM
- CDK
- CloudFormation
Benefits
- Medical, Dental, and Vision Coverage
- Maternity and Parental Leave Options
- Paid Time Off (PTO)
- 401(k) Plan
Preferred Qualifications
- Experience with non-relational databases and data stores such as object storage, document or key-value stores, graph databases, and column-family databases
- Master's degree or higher in computer science, engineering, analytics, mathematics, statistics, IT or equivalent