All roles

GCP Data Engineer + Java

Remote · USA Full-time New today

KANINI is seeking a highly skilled Data Engineer with deep expertise in Google Cloud Platform (GCP) and modern data architecture. The ideal candidate will have hands-on experience designing scalable data pipelines, implementing Medallion Architecture, and building robust enterprise-grade data solutions. This role requires strong technical proficiency in BigQuery, PySpark, Dataflow, and Airflow, along with a solid understanding of cloud data governance, performance optimization, and CI/CD practices.

Key Responsibilities

  • Design, develop, and maintain scalable batch and real-time data pipelines on GCP
  • Implement and manage Medallion Architecture (Bronze, Silver, Gold layers) for data processing
  • Build high-performance data transformations using Python and PySpark
  • Develop and optimize complex SQL queries for analytical workloads
  • Work extensively with BigQuery for large-scale data processing and performance tuning
  • Develop and deploy pipelines using Cloud Dataflow
  • Orchestrate workflows using Cloud Composer (Apache Airflow)
  • Manage data storage and lifecycle using Google Cloud Storage (GCS)
  • Implement version control and CI/CD pipelines using Git-based tools
  • Ensure data security, governance, and access control using GCP IAM
  • Optimize data solutions for performance, scalability, reliability, and cost-efficiency

Required Skills & Experience

  • Strong hands-on experience with Google Cloud Platform (GCP)
  • Expertise in BigQuery (partitioning, clustering, query optimization)
  • Proven experience implementing Medallion Data Architecture
  • Strong programming skills in Python and PySpark
  • Hands-on exposure on Java
  • Advanced proficiency in SQL (complex joins, window functions, performance tuning)
  • Hands-on experience with Cloud Dataflow
  • Experience with Cloud Composer (Airflow) for orchestration
  • Experience working with Google Cloud Storage (GCS)
  • Knowledge of version control systems (Git) and CI/CD practices
  • Strong understanding of GCP IAM and cloud security best practices

Preferred Qualifications

  • Experience working with large-scale enterprise data platforms
  • Knowledge of data warehousing and data lake concepts
  • Familiarity with real-time streaming frameworks
  • Experience in data governance and data quality frameworks
  • Exposure to Agile/Scrum methodologies

Apply To This Job

Related roles

Principal Site Reliability Engineer - ARINCDirect (Remote)

Remote · USA Full-time

Site Reliability Engineer (USA Only - 100% Remote)

Remote · USA Full-time

Site Reliability Engineer (SRE) - Remote

Remote · USA Full-time

Site Reliability Engineer-Remote (PST hours)

Remote · USA Full-time

Site Reliability Engineer II ( Remote )

Remote · USA Full-time

Senior Site Reliability Engineer – Remote US

Remote · USA Full-time

Site Reliability Engineer/ Chaos Engineer Remote to start

Remote · USA Full-time

Site Reliability Engineer (SRE) in Austin, TX (Remote)

Remote · USA Full-time

SRE “ Site Reliability Engineer”

Remote · USA Full-time

Senior Site Reliability Engineer, APAC

Remote · USA Full-time

Part-Time Remote Data Entry Specialist – Flexible Home‑Based Role for U.S. Candidates

Remote · USA Full-time

Staff Accountant I

Remote · USA Full-time

Head of MCS IO Finance

Remote · USA Full-time

Experienced Part-Time Remote Data Entry Clerk – arenaflex

Remote · USA Full-time

Experienced Customer Service Representative – Market Research and Data Entry Specialist (Part-Time Remote Work Opportunity)

Remote · USA Full-time

Experienced Customer Service Representative / Customer Advocate (Remote) - Join arenaflex's Dynamic Team

Remote · USA Full-time

Mortgage Loan Officer Assistant - Branch Support Vienna, VA (Remote)

Remote · USA Full-time

Medical Director, Behavioral Health - SIU

Remote · USA Full-time

Account Executive

Remote · USA Full-time

Experienced Data Entry Clerk (Remote) – Join arenaflex's Team and Help Canadians Get Back on Track

Remote · USA Full-time