APDG: Advanced Patient Data Generator
 

APDG is a knowledge-based approach of synthesizing large scale patient data. The basic rationale for this synthesis is to make the generated patient data look like realistic as possible as we could, by using various domain knowledge to control the patient data generation. Those domain knowledge can be collected from biomedical publications such as PubMed, from medical textbooks, or web resources (e.g. Wikipedia and medical websites). Those collected knowledge are formalized in the Patient Data Definition Language (PDDL) for the patient data generation.

We have used APDG to generate large scale data for breast cancer patients in the experiments of SemanticCT, a semantically-enabled system for clinical trials.