skills/scientific-thinking/hypothesis-generation

stars:0
forks:0
watches:0
last updated:N/A

Hypothesis Generation

Hypothesis generation is the bridge between observation and experimentation. A good hypothesis transforms vague curiosity into a testable prediction, making explicit what you expect to happen, why, and under what conditions. This skill provides frameworks for converting research gaps, contradictory findings, and cross-domain insights into falsifiable statements that drive experimental design and scientific progress.

When to use

  • Identifying testable predictions from literature gaps or contradictions
  • Converting brainstorming ideas into experimentally tractable questions
  • Formulating hypotheses from clinical or observational patterns
  • Developing grant-specific aims with clear predictions
  • Designing experiments to distinguish between competing theories
  • Applying cross-domain analogies to generate novel predictions
  • Moving from exploratory analysis to confirmatory testing

When NOT to use

  • Early-stage exploration without clear direction (use brainstorming)
  • Evaluating evidence quality of existing hypotheses (use critical thinking)
  • Confirming established knowledge (that's verification, not hypothesis generation)
  • When a phenomenon is already well-explained and no gap exists
  • For purely descriptive research without predictive component

Prerequisites

  • Sufficient domain knowledge to understand the current state of research
  • Familiarity with relevant literature and open questions
  • Basic understanding of experimental design principles
  • Access to data or observations to build upon
  • Awareness of competing theories or explanations in the field

Core workflow

1. Identify the hypothesis source

Determine where the hypothesis idea originates:

Knowledge gaps in literature

  • What questions remain unanswered?
  • What has been overlooked or insufficiently studied?
  • What contradictions exist between studies?

Contradictory findings

  • Studies that report opposing results
  • Inconsistent effect sizes across populations
  • Conflicting mechanistic explanations

Cross-domain analogies

  • Insights from other fields that might apply
  • Biological parallels to engineering solutions
  • Methodological advances from adjacent areas

Methodological advances

  • New techniques that enable previously impossible tests
  • More precise measurements or manipulations
  • Scale or resolution improvements

Clinical or observational patterns

  • Unexpected observations in practice
  • Patient/subject patterns noticed but unexplained
  • Natural experiments or quasi-experiments

Theoretical predictions

  • Predictions from models that need testing
  • Implications of established theories
  • Boundary conditions not yet explored

2. Specify the hypothesis anatomy

A well-formed hypothesis has five components:

1. Variables

  • Independent variable (what you manipulate or vary)
  • Dependent variable (what you measure)
  • Control variables (what you hold constant)

2. Directional prediction

  • What you expect to happen
  • Specific direction (increase/decrease, positive/negative)
  • Magnitude or effect size expectations

3. Mechanism (if causal)

  • Why you expect this to happen
  • Underlying causal chain
  • Theoretical basis for the prediction

4. Scope/conditions

  • When the hypothesis applies
  • Boundary conditions
  • Populations, settings, or contexts

5. Operationalization

  • How each variable will be measured
  • Cutoffs or thresholds for categorization
  • Specific experimental protocols

Template:"

"We hypothesize that [IV] will [directionally affect] [DV] in [population/samples], because [mechanism], and this will be measured by [operationalization]."

3. Apply the falsifiability test

Following Popper's criterion, a hypothesis must be falsifiable:

Questions to ask:

  • Could the hypothesis be proven wrong by observation?
  • Is there a conceivable result that would contradict it?
  • Are the variables operacionais in a way that allows rejection?

Types of unfalsifiable hypotheses to avoid:

  • No clear prediction (too vague)
  • Invulnerable to disconfirmation (always interprets结果是"支持")
  • Tautological (true by definition)
  • Untestable with available methods

Making hypotheses falsifiable:

  • Specify exact predictions, not just direction
  • State magnitude expectations
  • Define rejection criteria in advance
  • Identify alternative explanations

4. Design a test

How would you test this hypothesis?

Requirements:

  • An experiment or observation that could yield conflicting results
  • A way to measure the dependent variable
  • Appropriate controls
  • Sufficient sample/power

Design considerations:

  • What design can differentiate this from alternatives?
  • What would confirm? What would falsify?
  • What are plausible confounds?
  • How will you handle ambiguity in results?

5. Specify alternatives and boundary conditions

A strong hypothesis acknowledges:

Competing hypotheses

  • What else could explain the results?
  • What are the leading alternatives?
  • How does this hypothesis differ?

Boundary conditions

  • When would you expect this not to hold?
  • What are the limits of the prediction?
  • What contextual factors matter?

Effect size expectations

  • What magnitude of effect makes the hypothesis "true"?
  • What magnitude suggests falsification?
  • Is the effect clinically/practically significant?

Code patterns

Hypothesis specification template

HYPOTHESIS: [Number]

Variables:
- Independent variable: [precise definition]
- Dependent variable: [precise definition]
- Control variables: [list]

Prediction:
- We expect [IV] to [increase/decrease] [DV] by approximately [magnitude/percentage].

Mechanism:
- [IV] affects [DV] through [mechanism], based on [theory/prior work].

Scope:
- This prediction applies to [population/context].
- We expect this in [conditions], but not when [conditions].

Operationalization:
- [IV] will be operationalized as: [specific measurement]
- [DV] will be operationalized as: [specific measurement]

Rejection criteria:
- If [observed result], we reject the hypothesis.

Alternative explanations:
1. [Competing hypothesis 1]
2. [Competing hypothesis 2]

Example: From gap to hypothesis

Source: Two studies on mitochondrial function in aging report conflicting results - one shows decline, one shows no change.

Gap: Why the inconsistency? One uses tissue homogenates, one uses single cells.

Hypothesis Generation:

HYPOTHESIS 1: Tissue-level measurements obscure cell-type heterogeneity
in mitochondrial function decline during aging.

Variables:
- Independent variable: Measurement scale (tissue homogenate vs. single cell)
- Dependent variable: Mitochondrial function (ATP production rate)
- Control variables: Age, species, tissue type

Prediction: Single-cell measurements will reveal significant
age-related decline that tissue homogenates obscure by averaging
across cell subpopulations with different function trajectories.

Mechanism: Cell-to-cell variation increases with age; averaging
masks decline in high-function cells by diluting with low-function cells.

Scope: Applicable to post-mitotic tissues (neurons, muscle).
Not applicable to proliferating tissues with ongoing stem cell input.

Rejection criteria: If single-cell measurements show no greater
age-related decline than tissue homogenates, reject hypothesis.

Contradictory findings resolution framework

CONFLICT RESOLUTION: [Citation A] vs. [Citation B]

Both studies claim:
- Study A: [Claim]
- Study B: [Opposing claim]

Potential explanations:
1. Population differences (species, age, sex)
2. Methodological differences (assay, timing)
3. Context differences (in vivo vs. in vitro)
4. Statistical artifacts (power, analysis)

HYPOTHESIS to resolve:
We hypothesize that [variable X] explains the discrepancy,
specifically [prediction].

From clinical observation to hypothesis

CLINICAL OBSERVATION:
[Describe unexpected pattern noticed in practice]

PATTERN TO EXPLAIN:
[What was observed]

LEADING HYPOTHESES:
1. [Potential explanation 1]
2. [Potential explanation 2]

HYPOTHESIS TO TEST:
We hypothesize that [specific mechanism], based on
[observations from basic science / analogous conditions].

Testable prediction:
[What would we expect to see if this is correct?]

Common pitfalls

  • Vague predictions: "X affects Y" is not a hypothesis; "X increases Y by 20-30%" is.
  • Unfalsifiable wording: Avoid "may," "could," or "might" when making predictions.
  • Mechanical application: Don't force every observation into hypothesis form when it's better described as an exploratory finding.
  • Missing mechanism: Without a plausible mechanism, predictions are arbitrary.
  • Ignoring alternatives: Strong hypotheses specify what's being distinguished from.
  • Unrealistic scope: Start with narrowly testable hypotheses before combining.
  • Shoehorning: Don't try to find evidence FOR your hypothesis; design tests that could falsify it.
  • Neglecting boundary conditions: Specify when your hypothesis should NOT hold.

Validation

How to know your hypothesis generation was successful:

  • The hypothesis is stated as a specific, testable prediction
  • Variables are clearly defined and operacionalized
  • The hypothesis could be falsified by a plausible outcome
  • There's a clear experimental design to test it
  • Alternative explanations are acknowledged
  • The scope and boundary conditions are specified
  • The mechanism or theoretical basis is provided
  • Effect size expectations are stated
  • The hypothesis addresses a genuine gap or contradiction

References

  • Related ors- skills:*

    • ors-scientific-thinking-brainstorming (for initial exploration)
    • ors-scientific-thinking-critical-thinking (for evaluation)
    • ors-scientific-thinking-perspective-tour (for multi-perspective framing)
    • ors-scientific-thinking-failure-handling (if results are negative)
    • ors-research-grants-specific-aims (for hypothesis in grant context)
  • External resources:

    • Popper, K.R.. The Logic of Scientific Discovery
    • Lakatos, I.. Falsification and the Methodology of Scientific Research Programmes
    • CONSORT Statement - hypothesis reporting in trials
    • STROBE Statement - hypothesis reporting in observational studies

Changelog

  • 1.0.0 (2026-06-10): Initial adaptation by Pradyumna Jayaram, integrating Popper's falsifiability criterion, Lakatos's research programmes, and structured frameworks for converting knowledge gaps and contradictions into testable hypotheses.
    Good AI Tools