About the role
Operate and improve healthcare data infrastructure.
- •Operate and improve the infrastructure that moves large-scale healthcare datasets between partners, cloud platforms, and customers.
- •Key Responsibilities Execute and monitor large-scale data transfers across AWS S3, Google Cloud Storage, Azure Blob, Snowflake, and customer environments.
- •Use Python and SQL to join datasets, add derived columns, clean data, and validate CSV, Parquet, and database tables.
- •Leverage Protege's Dagster-based platform to orchestrate data processing and delivery.
- •Requirements Strong hands-on experience with data pipelines, both orchestrated and manual, in real production environments.
- •Fluency with command-line tooling in Linux or MacOS and strong scripting ability in Python, SQL, and Bash/shell.
- •Experience working with cloud storage systems and large-scale cross-cloud data movement.
Tech stack
AWSGoogle CloudAzureSnowflakePythonSQLBashDagster
Match insights
Tech:AWS, Google Cloud, Azure, Snowflake, Python
Level:Mid