Role - Data Engineer (Python / PySpark / GCP)
Location - Dallas, TX /Hartford, CT / Woonsocket, RI (Onsite)
Exp need - 6+ years
Job Descrition
This role requires strong Python and PySpark expertise, experience with big data tools, and the ability to collaborate with cross-functional teams to deliver actionable insights and high-quality data solutions.
Relevant Experience
• 5 years of experience in data engineering
Technical & Functional Skills
• Strong skills in Python programming
• Extensive experience with PySpark
• Hands-on experience with data pipeline and workflow management tools, including:
o GCP Dataproc
o Cloud Composer
o Google Cloud Storage (GCS)
• Strong knowledge of SQL, including advanced window functions
• Experience working with BigQuery (expertise not required)
• Ability to design and implement data engineering solutions, including:
o Data profiling
o Efficient data ingestion
o Building semantic data layers
Roles & Responsibilities
Location - Dallas, TX /Hartford, CT / Woonsocket, RI (Onsite)
Exp need - 6+ years
Job Descrition
This role requires strong Python and PySpark expertise, experience with big data tools, and the ability to collaborate with cross-functional teams to deliver actionable insights and high-quality data solutions.
Relevant Experience
• 5 years of experience in data engineering
Technical & Functional Skills
• Strong skills in Python programming
• Extensive experience with PySpark
• Hands-on experience with data pipeline and workflow management tools, including:
o GCP Dataproc
o Cloud Composer
o Google Cloud Storage (GCS)
• Strong knowledge of SQL, including advanced window functions
• Experience working with BigQuery (expertise not required)
• Ability to design and implement data engineering solutions, including:
o Data profiling
o Efficient data ingestion
o Building semantic data layers
Roles & Responsibilities
- Support analytics tools and products that leverage data pipelines to provide insights into customer acquisition, operational efficiency, and other key business metrics.
- Support, maintain, and monitor the infrastructure required for optimal data extraction, transformation, and loading (ETL/ELT) from a variety of data sources using SQL and GCP big data technologies.
- Collaborate with data and analytics experts to enhance the overall functionality and performance of enterprise data systems.
- Work with stakeholders-including Executive, Product, Data, and Design teams-to address data-related technical issues and support ongoing infrastructure needs.
- Develop, support, and monitor data pipelines, ensuring reliability and performance.
- Partner closely with clients to understand needs, collect relevant data, and deliver high-value data products.
- Quickly derive valuable insights from datasets, providing meaningful input to business decisions.
- Communicate complex findings in clear, accessible language and create visualizations that are easy for business and operational audiences to understand.
Top Searches
Job seekers searching for Diverse Lynx
Other job titles you may consider
Trending Searches in Dallas, TX
Jobs currently in Demand