2 Overview
2.1 Course philosophy
NREC6006 is not a traditional international trade theory course. It is an applied research course on gravity models in Python. The final objective is a reproducible empirical project that can develop into a publication-style gravity paper.
The course begins with replication. Students replicate a Post-Soviet gravity paper before designing their own regional extension. This sequence teaches how empirical trade research actually works: the paper, data, code, tables, and interpretation must be consistent.
2.2 Why replication comes first
Replication is the starting point because it forces students to answer practical research questions:
- What is the unit of observation?
- Which variables are required?
- Which sample restrictions matter?
- Which estimators are used?
- Which coefficients are robust?
- Which discrepancies require explanation?
The reference dataset contains 5,253 observations, 15 exporters, 15 importers, and years 1992-2020. The core institutional variables are \(wto\_joint\), \(EU\_joint\), and \(EAEU\_joint\).
2.3 Why Python is used
Python is used because it supports a complete applied workflow in one environment:
- data inspection and cleaning with
pandas; - numerical transformations with
numpy; - OLS, fixed-effects-style formulas, PPML, and GPML with
statsmodels; - descriptive plots with
matplotlib; - network analysis with
networkx; - reproducible notebooks that combine code, tables, and interpretation.
The course also helps students read gravity examples from other software traditions and translate them into Python-based workflows.
2.4 What students produce
By the end of the course, students produce:
- a documented gravity dataset;
- a reproducible Python notebook;
- replication tables for the Post-Soviet paper;
- robustness and sensitivity notes;
- a presentation;
- a publication-style gravity paper.
The paper is not written after the econometrics are finished. It is built throughout the course.
2.5 Replication workflow
The course follows a research workflow:
research question -> data -> descriptive analysis -> OLS -> FE -> PPML -> robustness -> paper
In practice, the workflow is iterative. Descriptive analysis may reveal a sample issue. Fixed effects may change the institutional interpretation. PPML may raise questions about zero trade flows. Robustness checks may require returning to the data.
2.6 Model sequence
Students learn the estimators in the order needed for the replication:
| Stage | Estimator | Purpose |
|---|---|---|
| Baseline | OLS | Teach the log-linear gravity model |
| Heterogeneity | FE OLS | Control for exporter and importer differences |
| Transformation | DDM | Show the mechanics behind two-way fixed effects |
| Normalization | BVU | Study trade intensity relative to economic size |
| Multiplicative model | PPML | Estimate gravity in levels |
| Robustness | GPML | Compare alternative multiplicative assumptions |
| Structural gravity | Structural PPML | Add exporter-year and importer-year fixed effects |
The goal is not to memorize estimators. The goal is to understand what each estimator controls for and how that changes interpretation.
2.7 Output standard
Every student output should be reproducible. A table in the paper must be traceable to a dataset, notebook cell, model formula, and interpretation note.
The course standard is not “the coefficient looks right.” The standard is: the data, code, estimate, and interpretation can be checked by another researcher.