All roles

New Grad Data Engineer ( for Health Tech Start Up)

Remote · USA Full-time New today

❤️ Want to make a tangible difference in the future of healthcare? ❤️ Love applying the latest technologies to solve tough problems? ❤️ Looking for a fulfilling career with plenty of learning and growth? About 1Phi 🤩 1Phi makes healthcare more accessible by lowering the cost and shrinking the time to preventive care and treatment. Through 1Access, we enable digital health apps and devices to accept insurance and integrate into the traditional clinical system. Through 1Sage, our healthcare navigator, we help patients find the treatment options with the best clinical outcomes for their unique situation. 1Phi (1st Principles Healthcare Institute) reflects our commitment to rethinking healthcare from the ground up. As a Public Benefit Corporation, we combine deep expertise in payer models, government policy, provider incentives, and patient advocacy. By unlocking data historically accessible only to industry insiders, 1Phi gives patients and healthcare companies the tools to make better decisions, and brings high-quality care within reach for more people. You would be joining an early stage startup team of engineers, healthcare experts and dreamers. You’ll help define the culture, the tech stack, and the future of our industry. Tight-knit team. Zero bureaucracy. Project ownership. Room to experiment. Accelerated growth. Requirements 🤓 What You’ll Do: Build and maintain data pipelines that ingest, transform, and validate large-scale Medicare claims data using SQL, Python, and Databricks (Spark). You'll work with patient-level records across billions of claim lines. Write and optimize complex SQL — multi-step transformations, window functions, joins across large datasets, aggregations with suppression rules. SQL is the primary language of the work. Automate and operationalize recurring data workflows — building reliable, repeatable pipelines that process CMS data extracts, dimension tables, and derived provider metrics. Ensure data quality by designing validation checks, reconciling source data against expected schemas, and investigating anomalies when numbers don't add up. Collaborate with data scientists and product engineers to define output schemas, deliver clean datasets, and support downstream analytics and application features. Work in cloud infrastructure — primarily Databricks on AWS, with exposure to S3, Unity Catalog, and related services. Learn the healthcare data domain — you'll develop working knowledge of claims data structures, medical coding systems (ICD-10, HCPCS, DRG), and CMS data programs. You're a fit if: You have strong SQL skills. Coursework, internships, or projects where you wrote non-trivial queries — joins, CTEs, window functions, aggregations. You can reason about query performance. You're comfortable with Python. You've used it for data manipulation (pandas, PySpark, or similar). You don't need to be a software engineer, but you can write clean, functional code. You understand data pipeline concepts — ETL/ELT, idempotency, schema management, data validation. Exposure through coursework, capstone projects, or internships counts. You're detail-oriented and methodical. Healthcare data has strict rules around suppression, privacy, and accuracy. You care about getting the numbers right. You're a fast learner who's comfortable ramping up on unfamiliar domains. You'll be learning Medicare claims data, CMS programs, and healthcare coding systems on the job. You have a BS or MS in Computer Science, Data Science, Information Systems, Statistics, or a related field. Even better if: You've worked with Spark, Databricks, or other distributed compute environments (even in a class or personal project). You have exposure to cloud platforms (AWS, GCP, or Azure) — S3, IAM, or managed database services. You've touched healthcare data in any capacity — claims, EHR, public health datasets, MIMIC, CMS public use files. You're familiar with version control (Git) and collaborative development workflows. You've built a data project end-to-end — ingestion through delivery — even if it was small. Salary: $60k-$95k depending on experience Location: Remote United States Only or Hybrid in the Chicago office is available. Hiring Process 😎 At 1Phi, we believe hiring should be fair, transparent, and collaborative. We work hard to find the best mix of respecting the candidates time, while also giving our team an efficient way to learn if a candidate is a good fit. Since we also strive to simulate the work environment throughout the process, we encourage you to use all the AI/LLM tools and outside resources at your disposal. During the process, if you ever feel that you are being asked for an onerous amount of work, please do not proceed. The best candidates are those who enjoy the process, regardless of the outcome. Here’s the entire process: Apply with email → Project demo call (15 mins) → Skills challenge (1-3 hrs) → Interviews (2 hours) → Meet the Team Day and micro-project (1 day) → Offer → Trial period (1-3 weeks) Benefits 🤑 Health insurance within 3 months of starting Generous vacation policy + company holidays 401K + profit share contributions Quarterly evals and performance bonus (~10% at start, ~20% after 4 years) How to Apply 😇 — 🎗️ We only consider applicants who email us. — Send us your resume and any relevant links (LinkedIn, GitHub, portfolio site, etc.) to [email protected] with the subject “Data Engineer”. (Visa considerations: We no longer accept OPT candidates, but can sponsor existing H1B holders.) Apply To This Job

Related roles

Data Engineer (Multiple Levels)

Remote · USA Full-time

[Remote] Data Engineer Associate - Contract

Remote · USA Full-time

Data engineer- DB2

Remote · USA Full-time

Experienced Full Stack Data Engineer – Cloud-Based Data Pipeline Development at careerzynith

Remote · USA Full-time

Senior Data Engineer

Remote · USA Full-time

Senior Data Engineer (Databricks or equivalent)

Remote · USA Full-time

SAP Business Intelligence Developer (Remote Job)

Remote · USA Full-time

Experienced Part-Time Data Entry Specialist – Business Intelligence and Analytics

Remote · USA Full-time

Level III Reporting - Junior Business Intelligence Analyst

Remote · USA Full-time

BI Analyst – Business Intelligence Analyst

Remote · USA Full-time

XPRF Clinical Education Specialist

Remote · USA Full-time

Junior Motion Designer (Performance Marketing)

Remote · USA Full-time

Experienced Part-Time Work From Home Data Entry Operator – Flexible Schedule and Competitive Compensation

Remote · USA Full-time

SAP BTP (Business Technology Platform) Developer

Remote · USA Full-time

Assistant BAS Controls Project Manager

Remote · USA Full-time

Social Worker, LMSW - NY LICENSE REQUIRED

Remote · USA Full-time

Experienced Part-Time Evening Data Entry Specialist – Remote Opportunity with arenaflex

Remote · USA Full-time

Experienced Customer Service and Medical Receptionist – Remote USA Opportunity at arenaflex

Remote · USA Full-time

Field Sales Professional- Midland, Texas

Remote · USA Full-time

Senior Full-Stack Developer - ShareGate Migrate [Web Experience]

Remote · USA Full-time