Back

Data Engineer (Python & PySpark)

Worldwide Salaried Open

Key Responsibilities

Pipeline Development: Design, develop, and maintain end-to-end ETL/ELT pipelines using Python and PySpark. Big Data Processing: Build large-scale data processing frameworks to handle structured and unstructured data, ensuring high performance and reliability. Cloud Infrastructure: Architect and manage data solutions within the GCP ecosystem, focusing on cost-efficiency and security. Data Modeling: Design and implement robust data warehouse models (Star/Snowflake schemas) and data lake architectures. Optimization: Identify, design, and implement internal process improvements, such as automating manual processes and optimizing data delivery for greater scalability. Collaboration: Work closely with stakeholders to understand data requirements and translate them into technical specifications. Technical Qualifications Core Programming: Strong proficiency in Python, including experience with libraries like Pandas, NumPy, and logging frameworks. Big Data: 3+ years of hands-on experience with Apache Spark (PySpark) for distributed data processing. GCP Ecosystem: Practical experience with Google Cloud services, specifically: BigQuery (Optimization, Partitioning, Clustering). Cloud DataProc or Dataflow. Cloud Storage (GCS) and Cloud Functions. Cloud Composer (Apache Airflow) for orchestration. Data Warehousing: Solid understanding of relational databases and SQL (PostgreSQL, MySQL) as well as NoSQL environments. DevOps & Tools: Experience with Git, Docker, and CI/CD pipelines. Familiarity with Terraform or other IaC tools is a significant plus. Apply To This Job

More jobs

Formateur Freelance - CAP Plomberie

Worldwide Salaried

Senior account executive

Worldwide Salaried

General Manager – B2C SaaS (Fully Remote)

Worldwide Salaried

Strategic Account Manager (m/w/d) | 4.000€ Fixum + Provision | 110K OTE | 100% Remote

Worldwide Salaried

Formateur Freelance - CAP Maintenance des Véhicules option A - Véhicules légers

Worldwide Salaried

Consultores BMC Helix (ITSM / Digital Workplace) - 100% Remoto

Worldwide Salaried

Brand Strategy Manager

Worldwide Salaried

Junior Visual Designer (m/f/x)

Worldwide Salaried

Java Full Stack Developer (Outbound/Selfservice)

Worldwide Salaried

WE ARE HIRING - Insegnante Per Ripetizioni di Greco

Worldwide Salaried

Geospatial Database Administrator

Worldwide Salaried

Experienced Customer Support Representative for arenaflex Online Course Platform

Worldwide Salaried

Registered Nurse Inpatient Unit PRN

Worldwide Salaried

Experienced Online Part-Time Disney Customer Support Representative – Delivering Magical Experiences to Disney Enthusiasts Worldwide

Worldwide Salaried

Associate Planner, Media - Ptarmigan

Worldwide Salaried

QA Engineer / Test Specialist (m/f/d)

Worldwide Salaried

Technical Customer Success Associate/Manager – Healthcare IT Specialist for Cloud‑Based MRI Reconstruction (SwiftMR)

Worldwide Salaried

Experienced Data Entry Clerk – Remote Opportunity with arenaflex

Worldwide Salaried

Senior Software Engineer, Core Experiences - Oxford, United Kingdom

Worldwide Salaried

Molecular Biologist | Upto $60/hr Remote

Worldwide Salaried