Natural history studies play a pivotal role in understanding how diseases develop and progress over time. These observational studies track disease trajectories without experimental intervention and are particularly important in rare disease research, where existing data is often limited. They serve as fundamental benchmarks against which the effectiveness of new treatments can be measured. Over the course of a study, extensive data is collected, including initial diagnosis, clinical observations, treatment history, and patient-reported outcomes on quality of life. This longitudinal data provides the foundation for understanding disease progression, identifying meaningful endpoints, and informing the design of future clinical trials.
The long-term nature of these studies generates large, complex datasets that require structured analysis. Artificial intelligence (AI) is increasingly being applied to support this process, from pattern recognition across patient populations to predictive modeling of disease trajectories. This article defines natural history studies, outlines their role in drug development, and examines specific ways AI is being used to improve how these studies are designed, conducted, and analyzed.
A natural history study is a preplanned observational study designed to track the course of a disease over time. Its purpose is to identify demographic, genetic, environmental, and other variables that correlate with the disease's development and outcomes. Traditionally, the "natural history" of a disease refers to its progression in the absence of intervention, from onset until resolution or end of life. In practice, most natural history studies include patients receiving the current standard of care or emergent treatment, which may alter some manifestations of the disease. Fully untreated populations are rare, both for ethical reasons and because most patients will have received some form of care prior to enrollment.
Natural history studies aim to capture various factors, including demographic, genetic, and environmental variables, that correlate with disease development and outcomes. It is worth distinguishing these studies from patient registries, which are organized systems for collecting and storing information about people with a disease. While registries tend to be broad in scope, natural history studies have a more specific goal: tracking how a disease progresses over a defined period. Some registries do function as natural history studies, depending on the depth and structure of the data they collect, but the two are not interchangeable.
Natural history studies are vital for a few key reasons:
Study design considerations: Natural history studies can be retrospective, prospective, or both. Retrospective studies draw on pre-existing data, such as medical records, to reconstruct a patient's disease trajectory. Prospective studies collect new data from patients over a defined period. Many studies combine both approaches, using historical records to establish baseline context while collecting new observations going forward. The choice of design has direct implications for data quality, regulatory utility, and the operational demands placed on sites and participants.
Regulatory authorities are recognizing the value of real-world evidence, and natural history studies are gaining prominence in drug development. The U.S. Food and Drug Administration (FDA) has issued draft guidance specifically addressing the use of natural history study data in rare disease drug development. The guidance notes that natural history information is usually not available or is incomplete for most rare diseases, making these studies particularly important. It emphasizes four key areas:
Natural history studies generate heterogeneous data across multiple sites, time points, and collection methods. Managing this complexity manually introduces delays, inconsistencies, and gaps in analysis. AI is being applied across several areas to address these challenges:
Across these areas, AI introduces capabilities that complement traditional research methods: faster pattern recognition, more consistent data handling, and the ability to work across larger and more diverse datasets. Its role in natural history studies is still evolving, but the practical applications are already visible in how studies are designed, monitored, and analyzed.
Natural history studies remain one of the most important tools for understanding disease progression, particularly in rare diseases where baseline data is limited. As the FDA has emphasized, well-designed natural history studies directly support the development of safe and effective therapies by establishing the evidence base needed for regulatory submissions.
AI is beginning to reshape how these studies are conducted, from improving data collection and standardization to enabling more sophisticated analysis across geographically dispersed populations. The practical impact is not speculative. It is visible in how study teams manage complexity, identify patterns, and generate actionable insights from longitudinal datasets.
For sponsors running precision medicine programs, the challenge is not simply collecting more data. It is building the infrastructure to collect, integrate, and act on that data across the full patient journey. Natural history studies are often the first step in that process, and how they are designed determines what is possible downstream.
To explore how Sano supports sponsors in designing and executing natural history studies, from patient finding and engagement to long-term data collection and recontact, get in touch.