Role : Google Cloud Data Architect – IAM Data Modernization
Location : Dallas, TX (4 days onsite)
Visa: H1B H4 EAD L2 H4 EAD
Project/Program
Identity & Access Management (IAM) Data Modernization – migration of an on-premises SQL data warehouse to a target-state Data Lake on Google Cloud (GCP), enabling metrics & reporting, advanced analytics, and GenAI use cases (natural language querying, accelerated summarization, cross-domain trend analysis).
About Program/Project
The IAM Data Modernization project involves migrating an on-premises SQL data warehouse to a target state Data Lake in GCP cloud environment. Key highlights include:
This modernization establishes a single source of truth for enterprise-wide data-driven decision-making.
Required Skills
Data Lake Architecture & Storage
· Experience with Hadoop/HDFS architecture, distributed file systems, and data locality principles
Qualifications
Data Ingestion & Orchestration
· Experience building batch and streaming ingestion pipelines using GCP-native services
· Knowledge of Pub/Sub-based streaming architectures, event schema design, and versioning
· Strong understanding of incremental ingestion and CDC patterns, including idempotency and deduplication
· Hands-on experience with workflow orchestration tools (Cloud Composer / Airflow)
· Ability to design robust error handling, replay, and backfill mechanisms