Health Data Host (HDS) certification
French authority standards compliance
Find patients for your investigation purposes is just a few clicks away:
- Filter by demographic criteria (date of birth, gender, living/deceased)
- Identify the right clinical profiles (diagnoses, prescriptions and procedures) using standard nomenclatures (ICD10, CCAM)
- Capitalize on all of the information stored in the databases, as well as in the documents themselves, thanks to our NLP algorithms.
This allows you to estimate the size of your cohorts and conduct your feasibility studies.
Explore allows you to create patient subgroups for research and monitoring purposes: clinical studies, evaluation of medical practices, care improvement, etc.
Explore helps you leverage your data with aggregated statistics on age, gender and status of your cohort's patients.
Use Arkhn Explore in your facility by :
- connecting the solution directly to your healthcare data warehouse if your hospital is already equipped with one
- integrating all data sources into our datalake to feed Arkhn Explore
Take advantage of Arkhn's expertise in data integration and quality control for cohort creation.
Use our solution directly on an HDS-certified French cloud for rapid deployment, or opt for an on-premises installation in your facility.
Would you like to take your data even further, and use it for purposes other than medical research and management?
Arkhn offers you a complete data architecture which enables you to integrate additional data sources and feed all your new projects.
Thanks to our data architecture, you'll have a datalake you can control, with all your data centralized, quality-controlled and standardized.
We are ISO 27001 and Health Data Host (HDS) certified and all our solutions are developed in compliance with CNIL standards. Your data is hosted by our HDS-certified French sovereign cloud provider, and can be hosted on-premises on request.
Our team has developed mastery in AI & NLP applied to healthcare, which is used in our solutions that help you to identify and structure all the medical data available in your patients' documents with rigorous quality control. We mobilize state-of-the-art technologies such as large language models to bring you the best algorithms.
Our aim is to provide healthcare facilities with reliable, usable healthcare data. This is made possible thanks to the essential step of data quality enhancement, carried out with the help of our data architecture solution.
We are committed to guaranteeing the syntactic and semantic interoperability of data. Arkhn brings together a team of specialists in healthcare standards - FHIR, v2, OMOP (EDHEN certification), ... - and various terminologies - SNOMED CT, LOINC, CIM, UMLS...
A tool enabling physicians to:
- secure and autonomous access to facility data
- facilitate and accelerate research projects by pre-screening patients
- reduce their dependence on technical teams
The Arkhn data architecture embodies the entire technical stack for deploying a facility's health data warehouse, from the methods for integrating data quality and documentation to the tools for accessing it.
Arkhn Explore is based on a minimal version of the Arkhn data architecture.
Arkhn adapts to the maturity level of your organization to create a comprehensive health data management framework:
- If the facility has its own health data warehouse, we can build on this to deploy Arkhn Explore.
- If the facility does not have its own health data warehouse: We deploy a minimal version of our data architecture to implement Arkhn Explore. It is also possible to deploy the full version of the Arkhn data architecture and access the Arkhn health data warehouse.
Our teams of data experts frame the project with the facility's staff and discuss it with the medical teams. The aim is to adapt to the level of maturity of each facility and to identify specific needs (use cases). We work with the professionals who master the data pathway in the centers (MID, ISN, Medical Team) to understand the patient and data pathways and identify their relevant sources.
We configure and integrate data into our tools. We verify data integration, carrying out a range of technical tests for data quality and the functional objectives defined in the use case (e.g. can Explore automatically find patients selected by hand in a preliminary study of the center?).
From a regulatory point of view, Explore's pre-screening view only contains aggregated data, and can therefore be accessed by all medical and research teams. Cohort extraction, on the other hand, is subject to approval by administrators (ISD, MID or other, depending on the hospital).
Pseudonymization of data is the mandatory step for research projects (with the exception of doctors who work with data from patients they see in their departments). Once the doctor has completed the pre-screening of the data, in other words, his feasibility study (which contains only aggregated data), he may then want to extract the identified patient data to carry out his study: this extraction includes a pseudonymization phase thanks to our NLP (Natural Language Processing) algorithms.Our pseudonymization algorithm uses state-of-the-art technologies (based on Transformer technology and BERT models), and is 99% accurate in identifying and masking a range of sensitive information (surname, first name, date of birth, PPI, address, email, telephone, etc.). It has been trained on a large database of reports manually annotated by medical experts.
A first category of NLP models are used to identify medical entities (procedures, diagnoses and prescriptions) within medical documents, and to standardize all available medical entities (structured or unstructured from documents) into standard ontologies (ICD10, etc.).
A second category of models is dedicated to the pseudonymization of medical documents, notably for clinical research projects.
NLP models systematically feed Explore's functionalities.
Our algorithms use state-of-the-art technologies (based on Transformer technology and BERT models). They identify clinical entities (such as drugs) and standardize them into a reference terminology (such as CCAM). Our performance is 97.5% on the identification (recall) of treatments, examinations, diagnoses, symptoms and drugs.
- Number of patients meeting criteria, with reminder of selected criteria.
- Various views of aggregated statistical data on age, sex and living/deceased status (min, max, mean, standard deviation, histogram, etc.).