Stay in Touch



Big Data Engineer (CAFE)



Health Catalyst



Salt Lake City, UT, US


About Health Catalyst

HealthCatalyst was named as one of the 30 Best Workplaces in Technology by Fortune Magazine and the 11th best place to work by Glassdoor. HealthCatalyst earned the highest overall score in Healthcare BI by KLAS and was named to the World’s Best 100 cloud companies by Forbes. HealthCatalyst analyzes healthcare records of almost a third of the US population (65 million patients) and recently released the first open source software for healthcare machine learning:

Health Catalyst’s platform and applications are being used at leading health systems including, John Muir Health, Kaiser Permanente, MultiCare Health System, Partners HealthCare, Providence Health & Services, Stanford Hospital & Clinics, Texas Children’s Hospital, and over 40 others. Health Catalyst products and services are utilized in over 400 hospitals and 4000 clinics, supporting over 90 million patients.

Our team lives the cultural attributes of Smart, Hardworking, and Humble. Learn more about working at Health Catalyst here:

Job Summary

Health Catalyst is aggregating the largest detailed healthcare dataset ever assembled; come help us make it happen! This position is a unique career opportunity to make an enormous, unprecedented impact on US healthcare. The Health Catalyst Collective Analytics for Excellence (CAFÉ) product line will integrate data from our 40+ clients, encompassing 65+ million patients and their data. CAFÉ enables (i) the comparison of outcome and detailed process metrics between organizations and national benchmarks and (ii) machine learning on multi-customer datasets to allow Health Catalyst to identify insights into healthcare treatment and process best practices.

Duties & Responsibilities

Help design, implement and manage all aspects of the data pipeline that will aggregate from customer sites and collect it in a central data lake for benchmarking and machine learning algorithms
Implement the data storage process and analytics technologies needed to merge data from multiple customers and eventually draw insights from the whole
Total data size will grow to the petabyte size, so a focus on scalability will be critical
Design and construct data products utilizing the central repository of customer data
Provide benchmarking metrics to customers through web-based dashboards and current Health Catalyst BI-tools

Required Skills

Fluent in at least one of Java, Python, or C#/C++ languages
Solid understanding of data technologies like SQL and NoSQL
Experience with data pipelines
Ability to deal with highly sensitive and confidential information and adhere to data security and confidentiality protocols and procedures
Self-motivated; comfortable working independently under general direction

Desired Skills

Experience with ‘big data’ analytic technologies (e.g. Hadoop, Spark, Elasticsearch, Druid)
Experience with cloud-based infrastructure (e.g. AWS, Microsoft Azure, Google Cloud Computing)
Experience with visualization technologies (D3 and HTML5)
Experience with Microservices and ESB technologies
Healthcare data and analytics experience
Familiarity with development methodologies including the AGILE development approaches
Familiarity with AI/machine learning techniques
Familiarity with Front End Frameworks (e.g. Angular, AngularJS, React)

Education & Relevant Experience

5+ years’ experience in technology or technology related field
BS/BA in Computer science, information systems, or other technology/science degree

The above statements describe the general nature and level of work being performed in this job function. They are not intended to be an exhaustive list of all duties, and indeed additional responsibilities may be assigned by Health Catalyst.

Apply for the job

Subscribe to our blog.


Blog & Newsletter Signup