Ford Motor Company
Data Engineer
Job Description
This individual will be responsible for creating products to host Supply Chain Analytics algorithms. Looking for someone with full stack experience, even if the specialty is in one area. This person should be a have 5-7+ years of experience with software engineering and testing, experience working in an Agile Environment, and experience with Rally. This person will have interactions with Ford leadership and needs to have good communication (both written and oral) and should feel comfortable not waiting and being told what to do.
- Develop EL/ELT/ETL pipelines to make data available in BigQuery analytical data store from disparate batch, streaming data sources for the Business Intelligence and Analytics teams
- Work with on-prem data sources (Hadoop, SQL Server), understand the data model, business rules behind the data and build data pipelines (with GCP) for one or more Ford verticals. This data will be landed in GCP BigQuery.
- Build cloud-native services and APIs to support and expose data-driven solutions.
- Partner closely with our data scientists to ensure the right data is made available in a timely manner to deliver compelling and insightful solutions.
- Design, build and launch shared data services to be leveraged by the internal and external partner developer community.
- Building out scalable data pipelines and choosing the right tools for the right job. Manage, optimize and Monitor data pipelines.
- Provide extensive technical, strategic advice and guidance to key stakeholders around data transformation efforts. Understand how data is useful to the enterprise.
Required Skills:
- Bachelor’s degree in Computer Science, Computer Engineering, Data Science, Analytics, or related field or a combination of education and equivalent experience.
- 3+ years of experience with SQL, Python & Java.
- 4+ years of experience with GCP cloud services (Dataflow, Big Query & Pub Sub)
- 3+ years of experience building out data pipelines from scratch in a highly distributed and fault-tolerant manner.
Desired Skills:
- Experience with GCP cloud services including BigQuery, Cloud Composer, Dataflow, CloudSQL, GCS, Cloud Functions and Pub/Sub.
- 1+ year experience with Hive, Spark, Scala, JavaScript.
- Proven track record of building applications in a data-focused role (Cloud and Traditional Data Warehouse).
- Inquisitive, proactive, and interested in learning new tools and techniques.
- Familiarity with big data and machine learning tools and platforms. Comfortable with open-source technologies including Apache Spark, Hadoop, Kafka.
- Strong oral, written and interpersonal communication skills.
- Comfortable working in a dynamic environment where problems are not always well-defined.