About the role:
As a Director of Data Operations, you will be leading most of the efforts of the company related to collection and curation of multimodal data. You will lead the team whose role spans from extracting the raw data in our data partners information systems to providing a fit-for-AI dataset which then can be used by data scientists. This involves many steps such as extraction, cleaning, curation, imputation, pseudonymisation, jointure of different datasets (e.g. histology slides with patient tabular information), alignment to a common data model and design of connectors to these data.
Owkin has a particular focus on multimodal datasets including histology and genomics data on top of clinical data. Our projects span various therapeutic areas and medical expertise (oncology, cardiovascular, immunity and inflammation). The team is in charge of managing the process of data curation across all modalities, with a particular expertise on clinical data.
Day to day you will interact with our data partners, often large hospitals, to extract and prepare specific datasets with the goal of using them later in data science driven biomedical research. You will have to drive the interactions with all stakeholders and data operations to produce a final clean and fit-for-AI dataset that is matching the project requirements.
Additionally, you will manage the process for generation of rich datasets using the latest technologies available in the field of genomics and single cell omics. This will involve participation in the definition of the acquisition pipeline and most importantly the organization of all the relevant data management activities.
You will also initiate the development of innovative approaches to curate data through automated tools, in order to optimize and scale the process of getting data on Owkin’s platform for biomedical research.
In particular, you will:
- Be responsible for the delivery of high quality multimodal datasets matching expectations of our internal clients, i.e. the data science and biomedical teams.
- Lead and grow a team of data stewards and data engineers to deliver all operations related to multimodal data.
- Lead data operation transformation and strategy development. Define the data operations strategy and propose & implement a roadmap. Propose a strategy to scale the activity
- Drive excellence in delivering all operations related to data quality. Audit current ways of working and propose external partnerships to optimize our processes. Propose standard documentation and processes.
- Lead interactions with data providers (e.g. hospitals) and develop excellent relationships with key stakeholders (EU and NA)Lead the budget (design, approval and monitoring) of the data operations team
- Manage relationships with external vendors and service providers complementing internal resources.
- Develop strong partnerships with CROs and other service providers. Contribute to selection and supervision as well as organization and monitoring of external partners.
Position is based either in the Paris, Nantes or London office or full remote, under the responsibility of the SVP Partnerships.
The responsibilities missions described are not an exhaustive list; additional tasks may be assigned or the scope of the job may change as necessitated by business demands.
About you
Required qualifications / experience:
- BS/MSc/PhD in Health Informatics, biomedical research, clinical studies or an associated field
- Proven experience in healthcare data management including an experience in running clinical studies involving the collection of multimodal datasets used for data science and/or delivering research ready datasets.
- Excellent understanding of technical aspects related to data management, including pseudonymisation methodologies;
- Experience in building team and developing talent, including technical expertise
- Experience of the organization of large hospitals and understanding of project management in this context.
- Experience working with private and sensitive personal information, including ethical and regulatory constraints
- Excellent task management strategies, with a sense of urgency and managing business priorities.
Bonus:
- Knowledge of healthcare Information System and main softwares
- Data engineering skills;
- Knowledge of histology, genomics and/or single cell genomics is a significant plus
- Professional experience in multicultural environment, and ideally one successful experience working on data operations projects outside Europe
#LI-MD1