Senior Data Engineer
Job Summary
The Senior Data Engineer will be an integral part of our data science team at the Regenstrief Center for Healthcare Engineering (RCHE) .This role will architect and develop data engineering solutions to support strategic partnerships, faculty research, operations, education, and outreach needs. This position will be responsible for designing, building, and maintaining data pipelines and tools to assemble and prepare complex datasets for downstream analysis. The position will work independently and within a team environment to gather requirements, review designs, implement and/or integrate new functionality, maintain systems, and assist with quality assurance; create and maintain system documentation to help train operational staff and user guides to train end users and provide some level of direct support to end users; and collaborate with research groups through participation in project meetings and assistance in outreach activities. This position will participate in national computing activities by attending workshops, conferences, presenting projects and contribute to writing conference and journal papers and grant proposals.
Who We Are at Purdue:
The Regenstrief Center for Healthcare Engineering (RCHE) conducts research to improve the quality, accessibility, and affordability of healthcare delivery through collaboration, partnerships, and engagement. RCHE’s team is comprised of researchers, staff, and outreach advisors that collaborate with the healthcare community to develop science-based approaches to personalized care, match health resources with community needs, and improve access to care among rural communities in Indiana and around the world. The RCHE is committed to promoting and advancing all forms of diversity, equity, inclusion, and access (DEIA) to create an environment and culture where the uniqueness of individuals is celebrated, and persons from all backgrounds can thrive.
Duties & Responsibilities
- Design, develop, deploy, and maintain data pipelines and data infrastructure tools.
- Define, design, and implement data governance platform strategies for operational, privacy, data quality, and security components.
- Build data models including robust data definitions, entity-relationship-attribute models, as well as relational and/or dimensional models.
- Design technical strategies and roadmaps for Data Integration, Data Warehousing, Analytics, Reporting, and Data Science.
- Validate the data quality and integration of all Data Architecture Components deemed to be cross-domain or enterprise in scope
- Responsible for collection of necessary Data Architecture metrics to support quality of processes and related artifacts.
- Maintain, propose changes, and verify compliance to Data Architecture Standards and Best Practices.
- Collaborate with research groups to gather requirements, review priorities, and plan development tasks with timelines.
- Collaborate with the data science team to support the successful delivery of data initiatives. Participate in design and code reviews.
- Work independently and collaborate in a team environment
Qualifications
Required:
- Bachelor’s degree in Computer Science, Computer Engineering, or Computer information related field
- Four (4) years of experience in one or more of the following:
- Working as a Data Architect, Database Developer, or in a similar capacity involving data management
- Architecting data-driven solutions involving relational and non-relational data stores, Data Warehousing platforms (OLAP and OLTP) and Data Lake concepts and architecture
- Designing, implementing, and migration cloud-based data platforms such as AWS, Azure
- Equivalent combinations of education and experience may be considered.
- Common programing language such Shell and Python, Java, and Scale
- Knowledge in technologies such as Hadoop, Spark, and other tools from the open-source big data ecosystem
- Knowledge of version control software, i.e. GIT
- Ability to work as part of a high performing team in a collaborative environment
- Ability to plan, organize and prioritize tasks, and complete projects with minimal supervision
- Demonstrated skills in data/pipeline design and development and standard data engineering and access management best practices
- Knowledge of common data engineering software, languages, and packages
- Excellent oral, written, and computer communication skills with strong analytical and troubleshooting skills
Preferred:
- Experience with health care related data such as electronic health records, PACS medical imaging, or wearable devices
- Knowledge of HIPAA rules and data security.
Additional Information:
- To learn more about Purdue’s benefits summary https://bit.ly/3t7vcRd
- A background check will be required for employment in this position
- FLSA: Exempt (Not Eligible For Overtime)
- Retirement Eligibility: Defined Contribution Waiting Period
- Purdue University is an EOE/AA employer. All individuals, including minorities, women, individuals with disabilities, and veterans are encouraged to apply
Nearest Major Market: Lafayette