Appendix C — Formula Sheet

C.1 Purpose

This appendix collects the formulas used in the Post-Soviet replication and in the final region-specific gravity paper. Use it to keep notation consistent across notebooks, tables, and the methodology section.

C.2 Notation

Symbol Meaning
\(i\) Exporter
\(j\) Importer
\(t\) Year
\(X_{ijt}\) Bilateral exports or trade flow
\(Y_{it}\) Exporter economic size
\(Y_{jt}\) Importer economic size
\(D_{ij}\) Bilateral distance or trade-cost proxy
\(Z_{ijt}\) Additional bilateral and institutional controls
\(\alpha_i\), \(\delta_j\), \(\lambda_t\) Exporter, importer, and year fixed effects
\(\mu_{ij}\) Country-pair fixed effect
\(\alpha_{it}\), \(\delta_{jt}\) Exporter-year and importer-year fixed effects

In the Post-Soviet project, \(X_{ijt}\) is measured by \(flow\), economic size by \(gdp\_o\) and \(gdp\_d\), distance by \(distw\), and institutional variables by \(wto\_joint\), \(EU\_joint\), and \(EAEU\_joint\).

C.3 Basic Gravity

The intuitive gravity equation is:

\[ X_{ij} = G Y_i^{\beta_1} Y_j^{\beta_2} D_{ij}^{\beta_3} \]

If distance is expected to reduce trade, \(\beta_3\) should be negative. Some textbooks write distance in the denominator; both forms express the same intuition when the distance coefficient is signed correctly.

C.4 Log-Linear Gravity

The baseline empirical model is:

\[ \begin{aligned} \log X_{ijt} &= \beta_0 + \beta_1 \log Y_{it} + \beta_2 \log Y_{jt} \\ &\quad + \beta_3 \log D_{ij} + \gamma Z_{ijt} + \varepsilon_{ijt} \end{aligned} \]

This model is easy to estimate with OLS and easy to interpret. It should be treated as a teaching baseline, not the final word, because it excludes zero trade flows and can be biased under heteroskedasticity.

C.5 Post-Soviet OLS Specification

The course replication begins with:

\[ \begin{aligned} \log(flow_{ijt}) &= \beta_0 + \beta_1 \log(gdp\_o_{it}) + \beta_2 \log(gdp\_d_{jt}) \\ &\quad + \beta_3 \log(distw_{ij}) + \gamma_1 comlang\_off_{ij} + \gamma_2 contig_{ij} \\ &\quad + \gamma_3 wto\_joint_{ijt} + \gamma_4 EU\_joint_{ijt} + \gamma_5 EAEU\_joint_{ijt} + \varepsilon_{ijt} \end{aligned} \]

Logged GDP coefficients are elasticity-like. Dummy coefficients should be converted to percent effects using the formula below.

C.6 Structural Gravity

A compact structural gravity expression is:

\[ \begin{aligned} X_{ij} &= \frac{Y_i E_j}{Y} \left( \frac{\tau_{ij}}{\Pi_i P_j} \right)^{1-\sigma} \end{aligned} \]

Here, \(\tau_{ij}\) is bilateral trade cost, \(\Pi_i\) is outward multilateral resistance, \(P_j\) is inward multilateral resistance, and \(\sigma\) is the elasticity of substitution. The key implication is practical: bilateral trade depends on relative trade costs, not only the direct cost between two countries.

C.7 Fixed-Effects Gravity

A standard fixed-effects version is:

\[ \begin{aligned} \log X_{ijt} &= \beta_1 \log D_{ij} + \gamma Z_{ijt} + \alpha_i + \delta_j + \lambda_t + \varepsilon_{ijt} \end{aligned} \]

Exporter, importer, and year fixed effects absorb broad country and time differences. They make coefficient interpretation more conditional.

C.8 Pair Fixed Effects

Pair fixed effects compare a country pair with itself over time:

\[ \begin{aligned} \log X_{ijt} &= \gamma_1 wto\_joint_{ijt} + \gamma_2 EU\_joint_{ijt} + \gamma_3 EAEU\_joint_{ijt} \\ &\quad + \mu_{ij} + \lambda_t + \varepsilon_{ijt} \end{aligned} \]

Time-invariant variables such as \(distw\), \(comlang\_off\), and \(contig\) are absorbed by \(\mu_{ij}\) and cannot be estimated separately.

C.9 Structural Fixed Effects

A saturated structural gravity specification often uses exporter-year and importer-year fixed effects:

\[ \begin{aligned} E[X_{ijt} \mid Z_{ijt}] &= \exp\left( \gamma Z_{ijt} + \mu_{ij} + \alpha_{it} + \delta_{jt} \right) \end{aligned} \]

This structure absorbs exporter-year supply conditions, importer-year demand conditions, and time-varying multilateral resistance. GDP variables are usually absorbed.

C.10 Double Demeaning

For a variable \(x_{ij}\) with exporter and importer groups:

\[ \tilde{x}_{ij} = x_{ij} - \bar{x}_{i\cdot} - \bar{x}_{\cdot j} + \bar{x} \]

The transformed variable removes exporter and importer means. Double demeaning is useful for teaching fixed-effects logic because it shows how group averages are removed before estimation.

C.11 BVU

Bonus Vetus OLS uses normalized gravity logic to approximate structural trade-cost effects. A simple teaching version uses trade intensity:

\[ \begin{aligned} \log TI_{ijt} &= \log(flow_{ijt}) - \log(gdp\_o_{it}) - \log(gdp\_d_{jt}) \end{aligned} \]

Then estimate:

\[ \begin{aligned} \log TI_{ijt} &= \beta_0 + \beta_1 \log(distw_{ij}) + \gamma Z_{ijt} + \varepsilon_{ijt} \end{aligned} \]

This is a teaching approximation. Publication work should verify the exact Bonus Vetus transformation against the intended reference implementation.

C.12 BVW

BVW extends the normalization logic by using GDP-weighted averages to approximate multilateral resistance:

\[ \begin{aligned} x^{BVW}_{ijt} &= x_{ijt} - \sum_j s_{jt} x_{ijt} - \sum_i s_{it} x_{ijt} \\ &\quad + \sum_i \sum_j s_{it}s_{jt} x_{ijt} \end{aligned} \]

Here, \(s_{it}\) and \(s_{jt}\) are GDP shares. The exact implementation is more involved than this compact expression, so students should treat classroom BVW code as a transparent learning tool.

C.13 PPML

PPML estimates trade in levels:

\[ \begin{aligned} E[X_{ijt} \mid Z_{ijt}] &= \exp\left( \beta_0 + \beta_1 \log D_{ij} + \gamma Z_{ijt} \right) \end{aligned} \]

PPML can include valid zero trade flows and is robust to important heteroskedasticity problems in log-linear gravity models. It is estimated with a Poisson conditional mean but does not require trade to be count data.

C.14 GPML

GPML uses a Gamma conditional mean with a log link:

\[ \begin{aligned} E[X_{ijt} \mid Z_{ijt}] &= \exp(Z_{ijt}\beta), \qquad X_{ijt} > 0 \end{aligned} \]

GPML is a positive-flow estimator. It is useful as a robustness check, not as a replacement for explaining zero-flow treatment.

C.15 NBPML

NBPML keeps the multiplicative conditional mean but allows a negative-binomial variance structure:

\[ \begin{aligned} E[X_{ijt} \mid Z_{ijt}] &= \exp(Z_{ijt}\beta) \end{aligned} \]

Use NBPML as a robustness estimator when overdispersion is substantively important and convergence is reliable.

C.16 Dummy Interpretation

For a dummy coefficient \(\hat{\gamma}\) in a log-link model:

\[ \% \Delta X = 100 \times \left[ \exp(\hat{\gamma}) - 1 \right] \]

Example interpretation template: joint membership in [institution] is associated with a conditional trade difference of \(100 \times [\exp(\hat{\gamma}) - 1]\) percent, holding the model controls fixed.

C.17 Elasticity Interpretation

When both dependent and independent variables are logged:

\[ \frac{\partial \log X}{\partial \log D} = \beta_D \]

A 1 percent increase in distance is associated with an approximate \(\beta_D\) percent change in trade. Because distance coefficients are usually negative, larger distance is expected to reduce trade.

C.18 Reporting Rules

  • State the dependent variable exactly.
  • State whether zero flows are included.
  • State which fixed effects are absorbed.
  • Do not report absorbed variables as if they were omitted accidentally.
  • Convert dummy coefficients with \(100 \times [\exp(\hat{\gamma}) - 1]\).
  • Use the same notation in tables, equations, and text.