GCP Data Engineer in Fusemachines

FULL_TIME

  Remote | Senior | Full time | Programming

Gross salary $4000 - 5000 USD/month

14 applications
Replies between 0 and 10 days
Last checked yesterday
Apply now Quick apply
Requires applying in English

Fusemachines is a leading AI strategy, talent, and education services, provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, the United States, Canada, and the Dominican Republic and more than 250 full-time employees) Fusemachines seeks to bring its global expertise in AI to transform companies around the world.

Funciones del cargo

  • Build, maintain and optimize data pipelines for our Enterprise Data Platform using Google Cloud Platform technologies.
  • Ingest new data sources from initial discovery & data architecture, through ETL authoring, operationalizing using Dataflow, Airflow (Cloud Composer) DAGs, and post launch lifecycle.
  • Research and test new big-data technologies and tools.
  • Advanced SQL queries, and modeling.
  • Help drive the roadmap of the Data Engineering team and Enterprise Data Platform.
  • Engage vendors with required features to meet business needs.
  • Leverage the full value of our vendor integrations and APIs.
  • Optimize BigQuery and ETL resources to decrease spend & increase performance.
  • Help guide team of engineers on best practices, standards, and toolsets to author ETL
  • Engineering pipelines

Requerimientos del cargo

  • Bachelor’s degree in data science, computer science or similar majors
  • 5+ years of experience in Data Engineering
  • 5+ years of experience in at least one OOP language, preferably Python
  • Deep SQL knowledge / experience
  • Experience in complex pipeline task management (i.e., Airflow and Beam)
  • Experience with Github

Opcionales

  • Google Cloud Platform tools or equivalent platform experience using tools s/a BigQuery, Dataflow / Apache Beam, Cloud Composer / Airflow, Apache Spark, Pub/Sub
  • Experience in Apache Beam
  • Experience with Scala
  • Experience with Scalr/Terraform
  • Experience with CI/CD

Conditions

Fully remote You can work from anywhere in the world.

Remote work policy

Fully remote

Candidates can reside anywhere in the world.

About Fusemachines

Fusemachines is a leading AI strategy, talent, and education services, provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal — Fusemachines's full profile