my jobs login
Welcome back
Register
Genetic disease diagnosis (Debate)
Home > Analysis Platform > Genetic disease diagnosis (Debate)
Introduction:

The method (MD2GPS) was designed to assist in the joint auxiliary diagnosis of genetic diseases based on the patient's genotype and phenotype.

Developer


This pipeline is developed by BMAP team.

Workflow


Data Agent

1.1 Variant Filtering Module: Annotates genetic variants in VCF files using ANNOVAR and snpEff, and applies multi-level filtering based on HGMD, ClinVar, MutationTaster2021, SIFT, and PolyPhen-2. Genes unrelated to the phenotype are further filtered using LLM.
1.2 Variant Prioritization Module: Ranks candidate mutations using a negative binomial test-based prioritization algorithm.
1.3 Natural Language Explanation Module: Utilizes LLM to generate explanations of the ranking results, including pathogenic mechanisms, gene functions, and phenotype associations, facilitating downstream debate and clinical interpretation.

Knowledge Agent

2.1 Performs symptom-based reasoning on the top 20 candidate pathogenic genes from the result of Data Agent using LLM, integrating gene function, phenotype relevance, and gene-gene interactions, and outputs ranking results in JSON format.

Debate Agent

3.1 Employs a debate framework grounded in Bayesian reasoning, where the Knowledge and Data Agents debate over a deduplicated and randomized list of candidate pathogenic genes.
3.2 Coordinates iterative rounds of reasoning between agents based on patient-specific information and supporting evidence from the Data Agent. Within a maximum of N rounds (configurable parameters), it guides convergence of opinions and synthesizes a final interpretable ranking report incorporating both annotations and knowledge-based insights.

workflow diagram

workflow.png

Address: 800 Dong Chuan RD. Minhang District, Shanghai, China SJTU-Yale Joint Center for Biostatistics, SJTU   

Copyright © 2021 沪交ICP备20190249. All Rights Reserved