Automated generation of synthetic MV distribution feeders

This section describes the methodology for automatically generating synthetic medium-voltage (MV) radial distribution feeders, based on the work of [Schweitzer et al., 2017]. The approach treats distribution feeders as tree graphs and identifies statistical patterns in their structural and electrical properties. These patterns are then exploited in a synthesis algorithm that generates feeders statistically similar to real-world data.

The core idea is that many node and edge properties in a radial feeder can be expressed as functions of the hop distance from the primary substation. This graph-centric perspective bridges Complex Network Science (CNS) metrics and power engineering design principles. A large dataset from a Dutch DSO (covering \(\sim 8200\,\text{km}^2\), \(\sim 100\) feeders) serves as the empirical basis.

Overview

The generation algorithm proceeds through five sequential steps, each governed by one or more statistical distributions identified from the data:

Node generation — determine the number of nodes and assign hop distances using a Negative Binomial distribution.
Feeder connection — connect nodes into a tree using a bimodal Gamma degree distribution, with degree clipping based on hop distance.
Node properties — assign load/generation to nodes using Beta, Poisson, Normal, and t-Location-Scale distributions.
Cable type assignment — select cable types from a library using the Exponential distribution for the current utilization ratio.
Cable length assignment — assign branch lengths using a modified Cauchy distribution, with hop-dependent length clipping.

The quality of synthesis is assessed using the KL-Divergence between distributions of the synthetic and real feeders.

Notation

The following conventions are used throughout:

Symbol	Meaning
\(N\)	Total number of nodes in a feeder
\(n\)	A single node object
\(M\)	Total number of branches in a feeder
\(m\)	A single branch object
\(n.h\)	Hop distance of node \(n\) from the HV source (number of edges along the unique path)
\(n.d\)	Degree of node \(n\) (number of incident branches)
\(n.P\), \(n.Q\)	Real and reactive power at node \(n\)
\(m.I_\text{est}\)	Estimated current in branch \(m\)
\(m.I_\text{nom}\)	Nominal (rated) current of the cable assigned to branch \(m\)
\(m.\ell\)	Length of branch \(m\)
\(\mathbb{1}(\cdot)\)	Indicator function (evaluates to 1 if the argument is true, 0 otherwise)
\(\mathcal{U}(0,1)\)	Uniform distribution on the unit interval

Properties are accessed via dot notation: e.g., \(n.P\) is the real power load at node \(n\).

Feeder definition and analysis

A feeder is defined as the section of the MV distribution system fed by a single primary substation MV bus, plus the HV source bus on the other side of the distribution transformer. The HV node is the source (\(n.h = 0\)), and the MV bus directly connected to it is the root (\(n.h = 1\)).

Feeders are identified from the full system graph \(G(V, E)\) by performing a Breadth First Search (BFS) from each MV neighbor of an HV source, excluding the HV source and its other neighbors. All nodes discovered in the BFS constitute the feeder.

Since distribution feeders are operated radially, the resulting subgraph is a tree. This has important consequences:

The unique path from any node to the source is unambiguous.
Hop distance \(n.h\) serves as a proxy for centrality measures (analogous to betweenness centrality in meshed networks).
Clustering coefficients are zero by definition.
Power flow can be estimated without knowing line parameters, since downstream power can be computed by summing loads along the tree.

An additional set of reduced feeders is obtained by merging nodes connected by negligible impedances (e.g., \(R = X = 1\,\mu\Omega\)). These are used for most of the analysis, as the distinction between a large busbar and two small busbars connected by near-zero impedance is immaterial for statistical purposes.

Verification methodology: KL-Divergence

The Kullback–Leibler (KL) Divergence is used throughout to quantify how well synthetic distributions match the real data:

\[D_{KL}(p \| q) = \int_{-\infty}^{\infty} p(x) \log \frac{p(x)}{q(x)} \, dx\]

where \(p(x)\) is the empirical distribution from the data and \(q(x)\) is the fitted model.

The KL-Divergence is operationally interpreted as follows: an observer trying to distinguish samples from \(p\) versus \(q\) will err with probability decaying exponentially in the number of observations, at a rate equal to \(D_{KL}\). A small \(D_{KL}\) means that synthetic samples are statistically indistinguishable from real data given a moderate sample size.

Meaningful ranges for \(D_{KL}\) are determined in two steps:

Functional law selection: considering aggregate (cumulative) data from all feeders for higher statistical relevance, the distribution with the lowest \(D_{KL}\) with respect to the data is selected.
Per-feeder range: the distribution of \(D_{KL}\) values between each individual feeder and the selected law provides a weighted range for the divergence.

Step 1: Node generation

The Negative Binomial distribution governs the assignment of hop distances:

\[f(x; r, p) = \frac{\Gamma(r + x)}{x!\,\Gamma(r)}\, p^r (1 - p)^x, \qquad x = 0, 1, 2, \ldots\]

where \(r > 0\) and \(0 \leq p \leq 1\). The Negative Binomial is interpreted as an over-dispersed Poisson: the variance exceeds the mean, but a mean and variance are sufficient to describe the process. This models the random process of “how far a node is from the source.”

Each node also receives a power factor from an empirical CDF extracted from the data, simplifying later computations to focus on real power only.

Step 2: Feeder connection

Procedure: ConnectNodes

Inputs: Mixture Gamma distribution parameters, maximum degree clipping function \(g_{d_\text{max}}(h)\).

All leaf nodes (at \(\max(h)\)) and the source (\(h = 0\)) are assigned degree 1.
The root (\(h = 1\)) has its degree determined deterministically by the number of nodes at \(h = 2\) plus one (for the source connection).
For each remaining node \(n\):
1. Sample a degree \(d_\text{tmp}\) from the mixture Gamma distribution.
2. Resample until \(d_\text{tmp} \leq g_{d_\text{max}}(n.h)\).
3. Assign \(n.d^* \leftarrow d_\text{tmp}\).
Sort nodes in ascending order of \(h\).
Connect from the furthest nodes toward the source: for each node \(n\) (from \(N\) down to 2), connect to the predecessor \(p\) (where \(p.h = n.h - 1\)) for which \(p.d - p.d^*\) is most negative (i.e., the predecessor whose actual degree is furthest below its target).

The degree distribution of the feeders is fit by a mixture of two Gamma distributions:

\[f(x; \pi, a_1, b_1, a_2, b_2) = \pi \cdot g(x; a_1, b_1) + (1 - \pi) \cdot g(x; a_2, b_2)\]

where:

\[g(x; a, b) = \frac{1}{b^a \, \Gamma(a)}\, x^{a-1} e^{-x/b}, \qquad x > 0\]

The bimodal Gamma reflects two clearly distinct decay rates observed in the empirical degree distribution. This is consistent with findings in the literature that exponential or sum-of-exponential distributions fit the degree distribution of distribution networks, with the Gamma being the conjugate prior of the Exponential.

The radial tree structure imposes deterministic constraints: leaf nodes have degree 1, the source has degree 1, and the root’s degree is fully determined by the number of nodes at \(h = 2\). The connection algorithm works bottom-up (from leaves toward root), always picking the predecessor whose actual degree most needs to increase.

Step 3: Node properties (load and generation)

Nodes are classified into three types: intermediate (no load), generation (negative load / power injection), and consumption (positive load). Each is handled by a separate sub-procedure.

Intermediate nodes

The Beta distribution models the fraction of zero-load nodes:

\[f(x; \alpha, \beta) = \frac{1}{B(\alpha, \beta)}\, x^{\alpha - 1}(1 - x)^{\beta - 1}, \qquad 0 < x < 1\]

where \(B(\cdot)\) is the Beta function. The Beta is a natural choice for modeling fractional quantities bounded between 0 and 1.

The hop distance of each intermediate node follows a mixture Poisson distribution:

\[f(x; \pi, \mu_1, \mu_2) = \pi \frac{\mu_1^x}{x!} e^{-\mu_1} + (1 - \pi) \frac{\mu_2^x}{x!} e^{-\mu_2}, \qquad x = 0, 1, 2, \ldots\]

The bimodal Poisson reflects the physical observation that junction nodes (no load) occur predominantly close to the primary substation, where main sub-feeders separate, with a secondary mode one-third to halfway down the feeder, reflecting further geographical splitting or voltage-level transitions.

Power injection (generation) nodes

The normalized hop distance (i.e., \(h / h_\text{max}\)) of injection nodes follows a mixture of two Normal distributions:

\[f(x; \pi, \mu_1, \sigma_1, \mu_2, \sigma_2) = \pi \cdot \mathcal{N}(x; \mu_1, \sigma_1) + (1 - \pi) \cdot \mathcal{N}(x; \mu_2, \sigma_2)\]

The main mode is close to the substation (small generators, PV installations, wind turbines), with a smaller bump further down the feeder (LV feeders feeding power back).

The deviation of each injection from the uniform distribution \(1/N_\text{inj}\) is:

\[\epsilon = \frac{n.P_\text{inj}}{\sum_{n} n.P_\text{inj}} - \frac{1}{N_\text{inj}}\]

which is found to be Normally distributed.

Positive load (consumption) nodes

The fundamental design principle is that utilities attempt to distribute load evenly across a feeder. The deviation from uniform distribution:

\[\epsilon = \frac{n.P}{\sum_n n.P} - \frac{1}{N}\]

is tightly centered around zero and follows a t-Location-Scale distribution:

\[f(x; \mu, \sigma, \nu) = \frac{\Gamma\!\left(\frac{\nu+1}{2}\right)}{\sigma\sqrt{\nu\pi}\,\Gamma\!\left(\frac{\nu}{2}\right)} \left(1 + \frac{1}{\nu}\left(\frac{x - \mu}{\sigma}\right)^2\right)^{-\frac{\nu+1}{2}}\]

where \(\sigma, \nu > 0\). In the limit \(\nu = 1\), this reduces to the Cauchy distribution. The empirical fit parameters indicate near-Cauchy behavior, reflecting heavy tails: most loads are close to the uniform value, but occasional significant deviations occur.

Step 4: Cable type assignment

The estimated current in each branch is computed without a full power flow, exploiting the radial tree structure:

\[m.I_\text{est} = \frac{m.S_\text{downstream}}{\sqrt{3}\, m.V_\text{nom}}\]

where \(m.S_\text{downstream}\) is the apparent power of all downstream nodes. This is the key advantage of the radial assumption: power flow is computable from topology alone.

The ratio \(I_\text{est}/I_\text{nom}\) follows an Exponential distribution:

\[f(x; \mu) = \frac{1}{\mu}\, e^{-x/\mu}, \qquad x \geq 0\]

The cable library is built from the data via k-means clustering on nominal current. It contains per-distance impedance parameters and the empirical frequency of each cable type, which is used to weight the selection.

Step 5: Cable length assignment

The modified Cauchy distribution is identified by observing a clear exponential decay in the magnitude of the empirical characteristic function (Fourier transform of the length histogram):

\[\varphi_x(t; x_0, \gamma) = e^{j x_0 t - \gamma |t|}\]

Only the Cauchy distribution exhibits such a decay, leading to the fit:

\[f(x; x_0, \gamma) = \left[\arctan\!\left(\frac{x_0}{\gamma}\right) + \frac{\pi}{2}\right]^{-1} \frac{\gamma}{(x - x_0)^2 + \gamma^2}\]

where \(x_0 \in \mathbb{R}\), \(\gamma > 0\), and the distribution is modified to have support \(x > 0\) only.

Clipping distributions

Since several of the fitted distributions have heavy tails or unbounded support, extreme samples can violate physical or engineering constraints. Clipping functions, all expressed in terms of hop distance \(n.h\), are applied to bound the sampled values.

Maximum degree clipping

Branching should decrease with distance from the substation. The maximum degree at each hop level follows a power law:

\[g_{d_\text{max}}(h) = a \cdot h^b\]

During degree assignment, the bimodal Gamma is resampled until \(d \leq g_{d_\text{max}}(h)\).

Maximum nominal current clipping

Larger (more expensive) cables are not used far from the substation. A threshold is applied: for \(h \geq 8\), the nominal current is restricted to \(I_\text{nom} \leq 450\,\text{A}\).

Maximum cable length clipping

Voltage drop constraints physically limit conductor length. The maximum length at each hop distance follows an exponential function:

\[g_\text{max}(h) = a \cdot e^{b \cdot h}\]

The Cauchy distribution is resampled until \(\ell \leq g_\text{max}(h)\).

The effect of clipping is to condition the distribution: \(f(x) \to f(x \,|\, x < x_\text{max}(h))\). This redistributes probability mass from outside the constrained domain into the domain, with the support depending on \(h\). These clipping trends are directly observed in the real data.

Emergent validation distributions

Two key distributions emerge naturally from the synthesis without being explicitly modeled, providing strong validation:

Downstream power distribution

The downstream power of a node \(n_i\) is the sum of all real power flowing past it toward the leaves:

\[n_i.P_\text{downstream} = \sum_{n_j \leftarrow n_i} n_j.P\]

where \(n_j \leftarrow n_i\) denotes all nodes downstream of \(n_i\). When normalized by total feeder load, this follows a Generalized Pareto distribution:

\[f(x; k, \sigma, \theta) = \frac{1}{\sigma}\left(1 + k \cdot \frac{x - \theta}{\sigma}\right)^{-1 - 1/k}\]

where \(x > \theta\) and \(k > 0\).

Per-unit voltage drop distribution

The estimated voltage drop across each branch, expressed as a fraction of nominal voltage:

\[m.\Delta V = \frac{m.I_\text{est} \cdot m.Z}{m.V_\text{nom}}\]

where \(m.Z\) is computed from the per-distance cable parameters and length \(m.\ell\). This also follows a Generalized Pareto distribution. The fact that these distributions emerge from the interplay of the individually modeled properties indicates that the complex interactions within the feeder are correctly captured.

Algorithm inputs and evaluation

The generation algorithm requires three scalar inputs:

\(N\) — number of nodes
Total load (MVA)
Total generation (MVA)

These are sampled from a three-dimensional Kernel Density Estimate (KDE) fitted to the real data vector \((N, \text{Load}, \text{Generation})\).

Performance is evaluated by generating a large ensemble (e.g., 427 synthetic feeders) and comparing distributions across all properties. When inputs are similar to the training data, the \(D_{KL}\) values remain low. When inputs deviate (e.g., doubled load), \(D_{KL}\) increases by roughly an order of magnitude, thus serving as an indicator of how “extreme” a generated case is relative to the data.

Summary of statistical distributions

Property	Distribution	Equation
Hop distance	Negative Binomial	\(f(x; r, p) = \frac{\Gamma(r+x)}{x!\,\Gamma(r)} p^r (1-p)^x\)
Degree	Mixture Gamma	\(f(x) = \pi g(x; a_1, b_1) + (1-\pi) g(x; a_2, b_2)\)
Fraction of intermediate nodes	Beta	\(f(x; \alpha, \beta) = \frac{x^{\alpha-1}(1-x)^{\beta-1}}{B(\alpha,\beta)}\)
Hop of intermediate nodes	Mixture Poisson	\(f(x) = \pi \frac{\mu_1^x e^{-\mu_1}}{x!} + (1-\pi) \frac{\mu_2^x e^{-\mu_2}}{x!}\)
Fraction of injection nodes	Beta	(same form as above)
Normalized hop of injection nodes	Mixture Normal	\(f(x) = \pi \mathcal{N}(x;\mu_1,\sigma_1) + (1-\pi) \mathcal{N}(x;\mu_2,\sigma_2)\)
Injection power deviation	Normal	\(\mathcal{N}(\mu, \sigma)\)
Load deviation	t-Location-Scale	\(f(x;\mu,\sigma,\nu) \propto (1 + (x-\mu)^2/(\nu\sigma^2))^{-(\nu+1)/2}\)
Current ratio \(I_\text{est}/I_\text{nom}\)	Exponential	\(f(x;\mu) = \mu^{-1} e^{-x/\mu}\)
Cable length	Modified Cauchy	\(f(x; x_0, \gamma) \propto \gamma / [(x - x_0)^2 + \gamma^2]\)
Max degree (clipping)	Power law	\(g_{d_\text{max}}(h) = a \cdot h^b\)
Max length (clipping)	Exponential function	\(g_\text{max}(h) = a \cdot e^{b \cdot h}\)
Downstream power (emergent)	Generalized Pareto	\(f(x;k,\sigma,\theta) = \sigma^{-1}(1+k(x-\theta)/\sigma)^{-1-1/k}\)
Voltage drop (emergent)	Generalized Pareto	(same form as above)

Generality and future directions

The authors note that while the distributions were calibrated on a Dutch DSO dataset, the methodology is general. Preliminary analysis of two U.S. feeders (IEEE 8500-bus and a California utility feeder) shows that similar statistical laws hold with different parameter values. The modular structure of the algorithm allows distributions to be swapped or reparametrized for different regions without changing the construction logic.

Current omissions acknowledged:

Transformers are minimal in the single-feeder model (only the HV/MV transformer). Future multi-voltage-level models will incorporate them via a library approach.
Capacitors are absent from the Dutch dataset (underground cables provide natural capacitance), but would need inclusion for other regions.
Meshed/reconfigurable topologies: the algorithm focuses on radial feeders. Normally-open switches enabling reconfiguration, and the cycle distribution of the full system, are left for future work.

The increasing \(D_{KL}\) when inputs deviate from the training data provides a built-in metric for assessing how “normal” a generated feeder is, enabling quality control in Monte Carlo applications.

Software implementation

The synthesis algorithm is implemented in the powergrid_synth.distribution subpackage.

Graph classes. Generated feeders can be wrapped as DistributionGrid objects, a subclass of PowerGridGraph (itself a networkx.Graph subclass). DistributionGrid provides convenience properties for radial tree grids:

root — the node at hop distance 0
max_hop — maximum hop distance in the feeder
is_radial — whether the graph is a connected tree
total_load_mw / total_gen_mw — aggregate power quantities
nodes_at_hop(h) / nodes_by_type(t) — node selection helpers

The companion TransmissionGrid class provides analogous helpers for meshed, multi-voltage-level transmission networks (voltage_levels, n_levels).

Reference grid conversion. To calibrate the algorithm from a real distribution grid rather than the default Table III parameters, the module distribution_converter provides:

pandapower_to_feeders() — converts a pandapower network to Schweitzer-format feeder graphs, handling lines, transformers, and closed bus–bus switches.
pypowsybl_to_feeders() — converts a pypowsybl Network (loaded from CGMES, XIIDM, MATPOWER, PSS/E, etc.) to the same Schweitzer-format feeder graphs.
feeder_summary() — returns a summary dict per feeder.

For transmission grids loaded from industry-standard formats, load_grid() and pypowsybl_to_nx() provide direct conversion from any pypowsybl-supported file format to a NetworkX graph compatible with the synthesis pipeline.

The fitted parameters can be fed directly to SchweetzerFeederGenerator for synthesis. See the DistributionSynthFromRef example notebook for a complete workflow.

[SSMP17]

Eran Schweitzer, Anna Scaglione, Antonello Monti, and Giuliano Andrea Pagani. Automated generation algorithm for synthetic medium voltage radial distribution systems. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 7(2):271–284, 2017. doi:10.1109/JETCAS.2017.2682934.