PradeepIT Consulting Services Pvt Ltd
Databricks (Remote)
Job Description
Roles & responsibilities
- Developing Modern Data Warehouse solutions using Databricks and AWS/ Azure Stack
- Ability to provide solutions that are forward-thinking in data engineering and analytics space
- Collaborating with DW/BI leads to understanding new ETL pipeline development requirements.
- Triage issues to find gaps in existing pipelines and fix the issues
- Work with businesses to understand the need in the reporting layer and develop a data model to fulfill
- reporting needs
- Help joiner team members to resolve issues and technical challenges.
- Drive technical discussion with client architects and team members
- Orchestrate the data pipelines in the scheduler via Airflow
- Qualification & experience
- Bachelor’s and/or master’s degree in computer science or equivalent experience.
- Must have a total of 6+ yrs. of IT experience and 3+ years’ experience in Data warehouse/ETL projects.
- Deep understanding of Star and Snowflake dimensional modeling.
- Strong knowledge of Data Management principles
- Good understanding of Databricks Data & AI platform and Databricks Delta Lake Architecture
- Should have hands-on experience in SQL, Python, and Spark (PySpark)
- Candidate must have experience in AWS/ Azure stack
- Desirable to have ETL with batch and streaming (Kinesis).
- Experience in building ETL / data warehouse transformation processes
- Experience with Apache Kafka for use with streaming data / event-based data
- Experience with other Open-Source big data products Hadoop (incl. Hive, Pig, Impala)
- Experience with Open Source non-relational / NoSQL data repositories (incl. MongoDB,
- Cassandra, Neo4J)
- Experience working with structured and unstructured data including imaging & geospatial data.
- Experience working in a Dev/Ops environment with tools such as Terraform, CircleCI, and GIT.
- Proficiency in RDBMS, complex SQL, PL/SQL, Unix Shell Scripting, performance tuning, an troubleshooting.
- Databricks Certified Data Engineer Associate/Professional Certification (Desirable).
- Comfortable working in a dynamic, fast-paced, innovative environment with several ongoing
- concurrent projects
- Should have experience working in Agile methodology.
- Strong verbal and written communication skills.
- Strong analytical and problem-solving skills with a high attention to detail. Mandatory Skills:
- Python/ PySpark / Spark with Azure/ AWS Databricks