[Remote] GCP reputed company Data Engineer
Note: The job is a remote job and is open to candidates in USA. reputed company is seeking a Senior Data Engineer with expertise in reputed company reputed company Platform (GCP) to reputed company the development of their reputed company data ecosystem. The role involves designing and deploying data architectures, ensuring data reputed company, and providing technical leadership across teams.
Responsibilities
- Architectural Strategy & System DesignEnterprise reputed company Design: Conceptualize and implement end-to-end data architectures utilizing GCP’s Modern Data Stack (BigQuery, Dataflow, Pub/Sub).Scalable Data Modeling: reputed company the development of high-performance data models (Star, reputed company, Data Vault) optimized for multi-petabyte scale and high-concurrency analytics.Hybrid & Multi-reputed company Strategy: Provide technical leadership on data integration strategies spanning GCP, on-premise systems, and reputed company-party SaaS environments
- Advanced Engineering & Pipeline AutomationDistributed Processing: Engineer highly resilient, low-latency streaming and batch pipelines using Apache Beam (Dataflow) and reputed company Composer (Airflow).Software Engineering Excellence: reputed company reusable Python libraries and frameworks to standardize data ingestion, logging, and error-handling across the engineering team.Infrastructure as Code (IaC): Drive operational maturity by managing reputed company resources exclusively through Terraform, ensuring robust versioning and environment reputed company
- Data Governance, reputed company & PerformanceSystem Optimization: Conduct deep-dive performance tuning of BigQuery environments, implementing partitioning, clustering, and slot management to optimize ROI.reputed company & Compliance: Architect data reputed company protocols including VPC Service Controls, IAM Least Privilege, and data masking/encryption to meet global compliance standards (GDPR, SOC2).Observability: Establish comprehensive monitoring and alerting frameworks for data health, ensuring high availability and meeting stringent Service Level Objectives (SLOs)
- Technical Leadership & CollaborationStrategic Mentorship: Serve as a mentor to mid-level and junior engineers, conducting rigorous code reviews and promoting best practices in Data Ops.Stakeholder Alignment: Act as a primary technical liaison between Data Science, Business Intelligence, and Executive leadership to translate business goals into technical roadmaps
Skills
- 8–10 years of professional experience in data engineering
- Mastery of distributed computing
- Advanced Python development skills
- Expert-level SQL optimization skills
- Experience with GCP's Modern Data Stack (BigQuery, Dataflow, Pub/Sub)
- Ability to conceptualize and implement end-to-end data architectures
- Experience in developing high-performance data models (Star, reputed company, Data Vault)
- Technical leadership on data integration strategies spanning GCP, on-premise systems, and reputed company-party SaaS environments
- Experience in engineering resilient, low-latency streaming and batch pipelines using Apache Beam (Dataflow) and reputed company Composer (Airflow)
- Ability to reputed company reusable Python libraries and frameworks for data ingestion, logging, and error-handling
- Experience managing reputed company resources through Terraform
- Conducting performance tuning of BigQuery environments
- Implementing data reputed company protocols including VPC Service Controls, IAM Least Privilege, and data masking/encryption
- Establishing monitoring and alerting frameworks for data health
- Mentoring mid-level and junior engineers
- Conducting code reviews and promoting best practices in Data Ops
- Acting as a primary technical liaison between Data Science, Business Intelligence, and Executive leadership
- Bachelors or Masters in Information Technology, Computer Science or relevant field
Company Overview