GN Model Physical Layer

This page describes the physical layer model used by XLRON's GN model environments (rsa_gn_model and rmsa_gn_model). It covers the modelling assumptions, noise sources, how ROADM nodes are treated, and what each configuration parameter controls.

The implementation is based on the closed-form ISRS GN model from:

D. Semrau, R. I. Killey, P. Bayvel, "A Closed-Form Approximation of the Gaussian Noise Model in the Presence of Inter-Channel Stimulated Raman Scattering," J. Lightw. Technol., vol. 37, no. 9, pp. 1924-1936, May 2019.

Overview of the Model

The GN model computes the signal-to-noise ratio (SNR) for each frequency slot on each fibre link in the network. The total noise on a channel is the sum of four independent contributions:

P_noise = P_ASE_inline + P_ASE_ROADM + P_NLI + P_TRX

Noise source	Symbol	Origin
Inline amplifier ASE	P_ASE_inline	Erbium-doped fibre amplifier (EDFA) spontaneous emission at each span
ROADM ASE	P_ASE_ROADM	Booster amplifiers at express and add/drop ROADM nodes
Nonlinear interference	P_NLI	Kerr effect in the fibre (SPM + XPM, with ISRS correction)
Transceiver noise	P_TRX	Back-to-back transceiver SNR limit

The per-channel SNR on a single link is then:

SNR_link = P_signal / P_noise

For a multi-link path, noise-to-signal ratios are summed across links (independent noise sources add in power):

1/SNR_path = sum over links on path of (1/SNR_link) + ROADM_ASE/P_signal

Note that inline ASE and NLI are computed per link, while ROADM ASE is computed per path (since it depends on the number of intermediate nodes traversed).

Link Model: Fibre Spans and Inline Amplifiers

Each link in the topology is divided into fibre spans. Topology files store link distances in km; the environment converts these to metres and divides each link into equal-length spans:

num_spans = ceil(link_length_m / max_span_length)
span_length = link_length_m / num_spans

The default max_span_length is 100 km. For example, a 350 km link has 4 spans of 87.5 km each.

Each span has an inline EDFA at the output that compensates for fibre loss. The amplifier gain accounts for both fibre attenuation and the ISRS (Inter-channel Stimulated Raman Scattering) Raman tilt, which causes higher-frequency channels to transfer power to lower-frequency channels during propagation. The ISRS-aware gain per channel is:

G_i = exp(alpha * L) / g_SRS_i

where g_SRS_i corrects for the frequency-dependent power transfer due to Raman scattering.

Fibre Attenuation

Fibre loss is characterised by the attenuation coefficient alpha in Nepers per metre. The default is 0.2 dB/km, which converts to approximately 4.605 x 10^-5 Np/m. The attenuation_bar parameter (mean attenuation, used internally for ISRS tilt calculations) defaults to the same value.

Inline Amplifier ASE Noise

Each inline EDFA adds ASE noise according to:

P_ASE = 2 * N_sp * (G - 1) * h * f * B

where:

N_sp = (NF_lin * G) / (2 * (G - 1)) is the spontaneous emission factor
NF_lin = 10^(NF_dB / 10) is the amplifier noise figure in linear units
G is the ISRS-aware amplifier gain
h is Planck's constant (6.626 x 10^-34 J.s)
f is the absolute channel frequency in Hz
B is the channel bandwidth in Hz
The factor of 2 accounts for both polarisation modes

The total inline ASE on a link is the sum over all spans: P_ASE_inline = num_spans * P_ASE_per_span.

The amplifier noise figure can vary across the spectrum. XLRON supports per-slot NF values loaded from a CSV file that maps spectral bands to noise figure and transceiver SNR values (see Spectral Band Data below).

Nonlinear Interference (NLI)

NLI arises from the Kerr effect in the fibre and has two components:

SPM (Self-Phase Modulation): a channel interferes with itself
XPM (Cross-Phase Modulation): other channels create interference through inter-channel nonlinear mixing

The GN model computes NLI power as:

P_NLI_i = P_i^3 * eta_n_i

where eta_n_i = eta_SPM_i + eta_XPM_i is the NLI efficiency coefficient, which depends on fibre parameters (gamma, beta_2, beta_3, alpha), channel spacing, channel bandwidths, and channel powers.

ISRS correction: The model incorporates the effect of stimulated Raman scattering on NLI. The Raman gain slope C_r modifies the effective attenuation seen by each channel, which in turn affects NLI efficiency. This means that NLI is not just a function of local channel properties but also depends on the total power and frequency distribution of all channels on the link.

Coherent vs incoherent accumulation: NLI from successive spans can accumulate either coherently (phased) or incoherently (power-summed), controlled by the --coherent flag. The coherence factor epsilon determines the scaling:

Incoherent (--coherent=False, default): SPM scales linearly with span count, XPM scales linearly
Coherent (--coherent=True): SPM scales as num_spans^(1 + epsilon) where epsilon > 0, producing slightly higher NLI

In practice, incoherent accumulation is a reasonable approximation for most fibre types and is the default.

Uniform vs non-uniform spans: When --uniform_spans=True (default), all spans on a link are assumed to have equal length, enabling a fast closed-form computation. When --uniform_spans=False, a jax.lax.scan loop iterates over spans with potentially different lengths.

Modulation Format Correction

When --mod_format_correction=True, the NLI calculation includes correction terms that account for the non-Gaussian statistics of specific modulation formats. Each format has an excess_kurtosis value (e.g., BPSK = -1, 16QAM = -0.68) that modifies the XPM contribution. Formats closer to Gaussian (kurtosis = -1) produce less NLI correction. This is only relevant for the rmsa_gn_model environment where modulation formats are explicitly tracked per channel.

Transceiver Noise

The transceiver SNR represents the back-to-back performance limit of the transmitter and receiver, independent of fibre propagation. It is modelled as an additive noise term:

P_TRX = P_signal / SNR_TRX_linear

Like the amplifier noise figure, the transceiver SNR can vary across the spectrum via the per-band CSV data file.

ROADM Node Model

ROADM (Reconfigurable Optical Add-Drop Multiplexer) nodes are modelled with separate express and add/drop loss parameters. Each ROADM contains a booster amplifier that compensates for the ROADM insertion loss, and this amplifier adds ASE noise.

For a lightpath traversing a path with N_links links:

Source node: 1 add ROADM with loss roadm_add_drop_loss (default 8 dB)
Intermediate nodes: N_links - 1 express ROADMs with loss roadm_express_loss (default 5 dB) each
Destination node: 1 drop ROADM with loss roadm_add_drop_loss (default 8 dB)
All ROADM amplifiers: noise figure roadm_noise_figure (default 5 dB)

The ROADM ASE noise is computed per path (not per link) because the number of intermediate nodes depends on the end-to-end route. The formula follows the same ASE calculation as inline amplifiers, but using the ROADM loss as the gain to be compensated:

G_express = 10^(roadm_express_loss / 10)
G_add_drop = 10^(roadm_add_drop_loss / 10)

P_ROADM_ASE = N_intermediate * ASE(G_express, NF_roadm) + 2 * ASE(G_add_drop, NF_roadm)

Power Budget Enforcement

The total optical power on each fibre link is constrained by max_power_per_fibre (default 21 dBm). When a new channel would cause the total power on any link along its path to exceed this limit, the action is rejected:

During masking: candidate placements that would exceed the power limit are masked out
During action checking: the power constraint is verified after tentative placement

The per-channel launch power is set by power_per_channel (in dBm). If not specified, it defaults to max_power_per_fibre divided equally among all slots. For example, with max_power_per_fibre=21 dBm (125.9 mW) and link_resources=100, the default per-channel power is approximately 1.26 mW (1.0 dBm).

Spectral Band Data

The amplifier noise figure and transceiver SNR can vary across the optical spectrum. XLRON loads per-band values from a CSV file (transceiver_amplifier_data.csv) that defines spectral bands with associated NF and transceiver SNR. Each frequency slot is assigned the values of the band it falls within.

The default data covers five bands spanning the partial S-band, C-band, and L-band (approximately 1485-1625 nm):

Band	Wavelength range	Amplifier NF	Transceiver SNR
Partial S-band	1485-1520 nm	7.0 dB	15.8 dB
S/C transition	1520-1529 nm	9.0 dB	17.8 dB
C-band	1529-1568 nm	5.5 dB	21.2 dB
L-band (main)	1568-1608 nm	6.0 dB	21.2 dB
L-band (edge)	1608-1625 nm	9.0 dB	17.1 dB

The C-band has the best amplifier performance (lowest NF) and transceiver performance (highest SNR), reflecting the maturity of C-band EDFA technology.

Band boundaries for inter-band gap enforcement are set by default (--enforce_band_gaps) and are defined in a separate file (band_data.csv), which specifies the standard optical bands (O, E, S, C, L, U) and their frequency ranges. This can be overridden with --band_data_filepath.

Band Preference for Heuristic Slot Allocation

When using first-fit or last-fit heuristics with a GN model environment, the --band_preference flag controls the order in which optical bands are filled. By default, slots are allocated in raw index order (i.e. by frequency). With --band_preference, slots in the most-preferred band are exhausted before moving to the next band.

For example, --band_preference=C,L will fill C-band slots first, then L-band slots. This is useful for multi-band scenarios where operators want to prioritise certain bands (e.g. fill C-band before spilling into L-band).

python -m xlron.train.train \
  --env_type=rsa_gn_model \
  --topology_name=nsfnet_deeprmsa_directed \
  --link_resources=100 --k=5 --load=250 \
  --continuous_operation --ENV_WARMUP_STEPS=3000 \
  --TOTAL_TIMESTEPS=100000 --NUM_ENVS=1 \
  --EVAL_HEURISTIC --path_heuristic=ksp_ff \
  --band_preference=C,L

The preference string is a comma-separated list of band names (matching the band_name column in band_data.csv). Bands not listed are appended in CSV order after the specified ones. The flag affects both first-fit and last-fit heuristics; for last-fit, slots within each band are filled from the high end first, but bands are still tried in preference order.

Modulation Formats

For rmsa_gn_model, modulation formats are loaded from a CSV file. Each format specifies:

Column	Description
`name`	Format name (e.g., QPSK, 16QAM)
`maximum_length`	Maximum optical reach in km (used for path pruning)
`spectral_efficiency`	Bits per symbol per polarisation (determines required slots)
`minimum_osnr`	Minimum required SNR threshold in dB
`inband_xt`	Inband crosstalk tolerance in dB
`excess_kurtosis`	Excess kurtosis for modulation format NLI correction

The number of slots required for a request is ceil(requested_bandwidth / (slot_size * spectral_efficiency)).

The SNR margin parameter (--snr_margin, default 1 dB) is added to the minimum_osnr threshold when checking whether a channel's SNR is sufficient.

Default formats for the GN model environments (from modulations_gn_model.csv):

Format	SE (b/s/Hz)	SNR threshold	Excess kurtosis
BPSK	1	12.6 dB	-1.00
QPSK	2	12.6 dB	-1.00
8QAM	3	18.6 dB	-0.82
16QAM	4	22.4 dB	-0.68
32QAM	5	26.4 dB	-0.52
64QAM	6	30.4 dB	-0.32

Calculating SNR Thresholds from Spectral Efficiency

Instead of using pre-specified minimum_osnr values from the CSV, the GSNR threshold can be calculated analytically from the modulation order using the --calc_minimum_osnr flag. When enabled, the minimum_osnr column is overwritten with values computed from:

GSNR_th(m') = f(m') * erfc_inv(g(m', beta_FEC))

where m' = spectral_efficiency is the modulation level (so that M = 2^m'), beta_FEC is the pre-FEC BER target (--beta_fec, default 1.5e-2), and erfc_inv is the inverse complementary error function. The formula has three cases:

m' in {1, 2} (BPSK, QPSK): GSNR_th = m' * erfc_inv(2 * beta_FEC)
m' = 3 (8QAM): GSNR_th = (2(M-1)/3) * erfc_inv(1.5 * beta_FEC)
m' in {4, 5, 6} (16QAM, 32QAM, 64QAM): GSNR_th = (2(M-1)/3) * erfc_inv(m' * beta_FEC / (2(1 - 1/sqrt(M))))

The result is converted to dB: GSNR_th_dB = 10 * log10(GSNR_th).

With the default beta_fec=1.5e-2, the calculated thresholds are:

Format	SE	Calculated GSNR threshold
BPSK	1	1.9 dB
QPSK	2	4.9 dB
8QAM	3	8.8 dB
16QAM	4	11.6 dB
32QAM	5	14.7 dB
64QAM	6	17.6 dB

Path-Level SNR Computation

The per-link SNR values (computed as described above) are combined into a path-level SNR for each frequency slot:

For each link on the path, convert link SNR to NSR (noise-to-signal ratio): NSR_link = 1 / SNR_link
Sum NSRs across all links on the path (independent noise sources add in power)
Add ROADM ASE contribution as P_ROADM_ASE / P_signal for occupied slots
Convert back to SNR in dB: SNR_path_dB = 10 * log10(1 / NSR_total)

This path-level SNR is what gets compared against modulation format thresholds.

Nyquist Subchannel Modelling (`num_subchannels`)

Motivation

A wideband optical channel (e.g. 100 GHz / 100 GBd) can be implemented as multiple narrower Nyquist subchannels (e.g. 8 x 12.5 GHz / 12.5 GBd subcarriers). The lower effective baud rate per subchannel reduces the self-phase modulation (SPM) nonlinear interference, because SPM depends on the square of the channel bandwidth through the arcsinh term in the GN model formula.

Without num_subchannels, the only way to model this effect would be to set slot_size=12.5 and use 8 slots per channel. But this inflates the number of frequency slots per link, the action space, and the O(N^2) XPM computation. The --num_subchannels parameter avoids this cost by analytically correcting the SPM term.

How It Works

The --num_subchannels flag (default 1) divides each slot's bandwidth into N Nyquist subchannels for the purpose of SPM calculation only:

Effective bandwidth: B_eff = slot_size / num_subchannels
SPM eta is computed using B_eff in place of slot_size in the GN model formula (both in the main arcsinh terms and in the coherence factor epsilon)
Power scaling: Each subchannel carries P/N power, so total NLI from N subchannels is N * (P/N)^3 * eta(B_eff) = P^3 * eta(B_eff) / N^2. The SPM efficiency is therefore divided by num_subchannels^2.
XPM: Unchanged. Cross-phase modulation between different frequency slots uses the physical slot bandwidth.
ASE: Unchanged. The noise bandwidth is the physical slot bandwidth.
Backward compatible: num_subchannels=1 gives B_eff = slot_size and scaling = 1, producing identical results to the default.

Best Practice

For Nyquist subchannel modelling, set slot_size equal to the desired channel bandwidth and num_subchannels equal to the number of subcarriers. For example, to model 100 GBd channels with 8 x 12.5 GBd subcarriers:

python -m xlron.train.train \
  --env_type=rsa_gn_model \
  --topology_name=nsfnet_deeprmsa_directed \
  --slot_size=100 --num_subchannels=8 \
  --link_resources=50 --k=5 --load=250 \
  --continuous_operation --ENV_WARMUP_STEPS=3000 \
  --TOTAL_TIMESTEPS=100000 --NUM_ENVS=1 \
  --EVAL_HEURISTIC --path_heuristic=ksp_ff

Worked Example: Per-Slot State Arrays

Consider a 400 Gbps request with spectral efficiency 2 b/s/Hz, slot_size=100 GHz, guardband=1, and num_subchannels=8.

The number of required slots is ceil(400 / (2 * 100)) + guardband = 2 + 1 = 3. Suppose the request is placed at initial slot index 5 on a link, occupying slots 5, 6, and 7.

The per-slot state arrays on that link are updated as follows:

Array	Slot 5	Slot 6	Slot 7	Notes
`link_slot_array`	1	1	1	Occupied (was 0)
`channel_centre_bw_array`	100	100	100	Always `slot_size` in GHz
`channel_power_array`	P	P	P	Launch power in watts (same for all slots in the channel)
`channel_centre_freq_array`	f_c	f_c	f_c	Centre frequency of the 3-slot block in GHz (midpoint of slot 5 and slot 7)
`path_index_array`	idx	idx	idx	Lightpath index identifying the path

Key observations:

channel_centre_bw_array stores slot_size (100 GHz) for every occupied slot, not the total channel bandwidth (300 GHz). The GN model sees 3 independent 100 GHz channels.
channel_centre_freq_array stores the same centre frequency for all 3 slots -- the midpoint of the block. This is used for XPM calculations between different channels.
channel_power_array stores the per-channel launch power. Each slot is treated as an independent channel with full launch power.

With num_subchannels=8, the GN model computes SPM for each slot as if its 100 GHz bandwidth were divided into 8 x 12.5 GHz subchannels (B_eff = 12.5 GHz), with eta_spm scaled by 1/64. XPM between the three slots (and between these slots and other channels on the link) is computed using the physical 100 GHz bandwidth, unchanged.

Centre Frequency Caching

The channel_centre_freq_array is maintained as part of the environment state to avoid recomputing centre frequencies on every SNR calculation. It is:

Initialised to zeros on environment reset
Set when a lightpath is placed (same value written to all slots in the channel)
Cleared when a lightpath expires (multiplied by the expiry mask, same pattern as other per-slot arrays)
Restored from a previous snapshot (channel_centre_freq_array_prev) if an action is rolled back due to a failed SNR or power check

This caching is especially beneficial in rmsa_gn_model, where the masking step evaluates the GN model for many candidate placements.

`rsa_gn_model` Environment

The rsa_gn_model environment performs Routing and Spectrum Assignment with the GN model providing physical layer awareness, but without per-step modulation-format-aware masking.

Intended Use

Throughput capacity studies: measure network-level Shannon throughput under realistic physical layer constraints
Fast simulations: action masking uses standard RSA slot-availability checks (no GN model evaluation during masking), making it significantly faster than rmsa_gn_model
Scenarios with predetermined modulation: when modulation format is fixed or determined by path distance rather than real-time SNR

How It Works

Action masking: uses the standard RSA mask (contiguous free-slot check). No GN model evaluation during masking.
Action execution: places the lightpath and updates channel_power_array, channel_centre_bw_array, and path_index_array on the affected links.
SNR update: after each step, link_snr_array is recomputed for all links using the GN model.
Action check: check_action_rmsa_gn_model verifies that all active lightpaths still meet a basic SNR threshold and that the power budget is not exceeded. If the check fails, the placement is rolled back.
Throughput measurement: at episode end, in the RSA GN Model env (RMSA GN env already tracks bitrate), computes Shannon-Hartley throughput for all active lightpaths:

throughput_per_LP = log2(1 + SNR_linear) * slot_size_GHz * 2 * (1 - FEC_overhead)

The factor of 2 is for dual polarisation. The default FEC overhead is 28% (--fec_threshold=0.28).

Observation Space

The observation includes the request (source, destination, bandwidth, holding time) plus per-path statistics:

Mean free block size and free slot count
Path length in hops and distance
Number of active connections, mean power, and mean SNR of connections on each path

When to Use

Use rsa_gn_model when you want physically-aware throughput measurement but don't need the GN model to be evaluated at every masking step. This is appropriate for studying how routing and spectrum assignment strategies affect achievable throughput, or for large-scale simulations where the per-step cost of full GN model masking would be prohibitive.

`rmsa_gn_model` Environment

The rmsa_gn_model environment performs Routing, Modulation and Spectrum Assignment with full GN model evaluation during action masking. This is the most physically realistic environment in XLRON.

Intended Use

Physically realistic RL training: the agent sees only genuinely feasible actions, learning to make decisions that respect nonlinear interference constraints
Modulation-adaptive networking: the environment automatically selects the best modulation format for each placement based on current network conditions
Studying NLI-aware resource allocation: understanding how channel placement affects neighbouring channels through nonlinear interference

How It Works

Action masking (the key difference from rsa_gn_model): for each candidate combination of (path, modulation format, slot position), the mask function:
- Tentatively places the channel with the candidate's power and bandwidth
- Runs the full ISRS GN model to recompute SNR across all affected links
- Checks that the new channel meets its modulation format's SNR threshold
- Checks that all existing channels still meet their respective thresholds
- Checks that the total power on each link does not exceed max_power_per_fibre
- Only marks the action as valid if all checks pass
Candidate evaluation strategy: to keep the candidate count manageable, only the first-fit (FF) and last-fit (LF) slot positions are evaluated per (path, modulation format) pair, giving 2 * k * M candidates total (where k = number of paths, M = number of modulation formats). All candidates are evaluated in parallel using jax.vmap.
Modulation format selection: the mask stores the winning modulation format index for each valid slot position in mod_format_mask. When an action is taken, the environment looks up the pre-computed modulation format rather than re-evaluating.
Action execution: places the lightpath with the selected modulation format's spectral efficiency (determining slot count) and the configured launch power. Updates all per-slot state arrays.
Action check: same as rsa_gn_model -- verifies SNR sufficiency, RSA validity, and power budget.

Performance Considerations

The masking step is computationally expensive because it runs the GN model for every candidate. Two optimisations are available:

get_snr_link_array_fused (used when --uniform_spans=True and --mod_format_correction=False): a fully inlined version that reduces XLA operations by ~41-48% compared to the standard version
FF/LF only: evaluating only first-fit and last-fit positions (rather than all free slots) keeps the candidate count at O(k * M) rather than O(k * M * link_resources)

FEC Code Rate

The --fec_rate parameter (default 0.8) models the overhead introduced by forward error correction. When a request is successfully accepted, the bitrate counted towards accepted_bitrate is scaled by this factor:

accepted_bitrate += requested_datarate * fec_rate

This reflects the fact that a fraction (1 - fec_rate) of the transmitted symbols carry FEC redundancy rather than user data. For example, with fec_rate=0.8, a 100 Gbit/s request contributes 80 Gbit/s of effective user throughput to the accepted_bitrate metric.

This parameter only applies to rmsa_gn_model.

Metrics

The rmsa_gn_model environment tracks:

accepted_services: count of successfully routed requests
accepted_bitrate: cumulative effective bandwidth of accepted requests (scaled by fec_rate)
Blocking probability: fraction of requests that could not be served

It does not compute Shannon throughput (unlike rsa_gn_model).

Distributed Raman Amplification (DRA)

XLRON supports an optional Distributed Raman Amplification (DRA) model that replaces the EDFA-only ISRS NLI calculation with a Raman-pump-aware model. When enabled via --use_raman_amp, the nonlinear interference is computed using a 9-mode combination approach that accounts for the frequency-dependent Raman gain profile created by co- and counter-propagating pump lasers.

Physics Model

In a Raman-amplified span, pump lasers inject power into the fibre at frequencies offset from the signal band. Through stimulated Raman scattering, pump power is transferred to the signal channels, providing distributed gain along the fibre rather than lumped gain at span boundaries. This changes the signal power profile along the fibre, which in turn modifies the nonlinear interference.

The DRA NLI model computes the effective power profile using fit parameters [C_f, a_f, C_b, a_b, a] that describe the forward pump contribution, backward pump contribution, and fibre attenuation. These parameters are used to evaluate 9 mode combinations (l_1, l_2, l_1', l_2') of forward/backward Raman directions, each contributing to the total NLI efficiency coefficient.

The model supports both SPM and XPM contributions with coherent accumulation across spans, including correction terms for correlated inter-channel effects.

Triangular Raman Approximation

The Raman gain spectrum g_R(delta_f) of silica fibre has a broad, irregular shape peaking around 13 THz offset. The triangular Raman approximation simplifies this to a linear function of frequency offset:

g_R(delta_f) = C_r * |delta_f|

where C_r is the Raman gain slope (--raman_gain_slope), a single parameter that captures the strength of the Raman interaction. C_r has units of 1/(WmHz) and characterises how much power is transferred per unit frequency offset, per watt of pump power, per metre of fibre. It combines the intrinsic Raman scattering cross-section of the silica glass with the fibre effective area.

Typical values:

Standard single-mode fibre (SMF-28): C_r ~ 2.8e-17 1/(WmHz) (the default)
Higher-nonlinearity fibres or effective values used for wideband models may be larger

The triangular approximation is valid within approximately 15 THz of modulated bandwidth. When DRA is enabled, compute_band_layout automatically trims bands to fit within --raman_max_bandwidth_thz (default 15.0 THz), removing slots from the lowest-priority band first.

Fitting the DRA Parameters

The DRA model stores a per-channel parameter array of shape (6, num_channels, max_spans). The 6 rows have distinct origins and serve different purposes in the simulation:

Row	Symbol	Origin	Used by
0	`C_f`	LM fit to ODE profile	`gn_model_dra()` — NLI calculation
1	`a_f`	LM fit to ODE profile	`gn_model_dra()` — NLI calculation
2	`C_b`	LM fit to ODE profile	`gn_model_dra()` — NLI calculation
3	`a_b`	LM fit to ODE profile	`gn_model_dra()` — NLI calculation
4	`a`	LM fit to ODE profile	`gn_model_dra()` — NLI calculation
5	`G_Raman`	Direct from ODE endpoint	`get_snr_dra()` — ASE noise / Friis cascade

Rows 0-4 and row 5 are both derived from the same ODE solution but serve different roles:

Rows 0-4 parameterise the semi-analytical power profile shape along the entire span, which enters the NLI integral via the Raman tilt factors (Tf, Tb, T) in the 9-mode eta functions of gn_model_dra().
Row 5 is the per-channel Raman gain at the span endpoint, used only for the ASE noise figure calculation in get_snr_dra().

Step 1: Solve the Raman ODE

The coupled signal+pump propagation ODE is solved along the span using jax.experimental.ode.odeint (Dormand-Prince adaptive method). The state vector y contains all signal channels and backward pump lasers:

dy/dt = (g_R @ y - att_vec) * y

where g_R is the Raman coupling matrix (triangular approximation, 15 THz cutoff) and att_vec is +attenuation for forward-propagating signals and -attenuation for counter-propagating backward pumps. The ODE is evaluated at 501 equally-spaced z-points along [0, L] (201 in the differentiable variant fit_dra_params_jax).

Signal-signal ISRS is excluded from the ODE by zeroing g_R[:num_channels, :num_channels], because the GN model handles signal-signal ISRS separately via its perturbative tilt formula. Including it would double-count the effect.

Backward pump boundary condition: The backward pump propagates from z=L to z=0, but the ODE integrates forward. The initial BW pump power at z=0 is found via scipy.optimize.minimize (TNC method) so that the pump power at z=L matches the configured --raman_pump_power_bw values.

Step 2: Extract per-channel ODE Raman gain (row 5)

From the ODE power profile P(z), the per-channel Raman gain is computed directly as:

G_Raman(i) = P_i(z=L) / [P_i(z=0) * exp(-alpha * L)]

This is the ratio of actual signal power at the span output to what it would be with pure fibre attenuation (no pumps). It captures pump depletion, multi-pump interference, and all ODE-resolved effects. No fitting or approximation is applied — this is a direct readout from the numerical ODE solution.

Step 3: Fit the semi-analytical profile (rows 0-4)

The normalised ODE profile rho_norm(z) = P(z) / P(0) is fitted per-channel to the semi-analytical formula from Semrau et al.:

rho(z) = exp(-a * z) * (1 - delta_f * (C_f * P_f * L_eff_f(z) + C_b * L_eff_b(z)))

where:

L_eff_f(z) = (1 - exp(-a_f * z)) / a_f — forward pump effective length
L_eff_b(z) = (exp(-a_b * (L-z)) - exp(-a_b * L)) / a_b — backward pump effective length
delta_f = f_channel - f_hat — frequency offset from the mean pump frequency
P_f = total forward power (signals + forward pumps); uses P_b=1 convention where backward pump power is absorbed into C_b

The 5 parameters [C_f, a_f, C_b, a_b, a] are fitted via jaxopt.LevenbergMarquardt as multipliers of physically-motivated initial values [Cr, att, Cr, att, att]. The fit minimises the residual between rho_semi(z) and rho_norm(z) across all z-points along the span.

Important: The fitted attenuation a (row 4) generally differs from the physical fibre attenuation alpha. The LM fit adjusts a to best match the overall profile shape, which means it absorbs some of the Raman gain into the exponential decay term. For channels with strong Raman gain, a can be significantly smaller than alpha (or even negative). This is by design — the semi-analytical formula is optimised for the NLI integral where the profile shape along the whole span matters, not just the endpoint gain.

Relationship between ODE and semi-analytical results

Since both are derived from the same ODE solution:

Row 5 (ODE gain) is the ground truth for endpoint Raman gain — it is used wherever an accurate per-channel gain value is needed (ASE noise figure).
Rows 0-4 (semi-analytical fit) approximate the power profile shape — they are used in the NLI integral where the z-dependent behaviour matters. They are a good but imperfect approximation; the fit quality is optimised for the integral, not for the endpoint value.

The triangular Raman approximation used in the ODE produces smooth, monotonic gain profiles by construction. To capture the true peaked/structured Raman gain spectrum of silica fibre (which would show spectral ripple from multi-pump interactions), a measured Raman gain profile would be needed instead of the triangular model.

Why offline fitting is valid: The signal power profile along a Raman-amplified span depends only on the fibre properties (attenuation, Raman gain slope), span length, and pump configuration — not on which signal channels are currently active. The Raman pump power dominates the interaction. Therefore the fit parameters can be computed once at environment creation time and reused for all subsequent calculations during the simulation.

The entire fitting procedure runs once during make_env.py environment creation (outside JAX JIT compilation) and typically completes in a few seconds. The parameters are stored as a static array (uniform across spans) in EnvParams.raman_fit_params.

Pump depletion: With high Raman gain slopes or many signal channels, the backward pump can be substantially depleted within the first few kilometres of the span. For example, with C_r = 2.37e-16 and 91 channels, the pump e-folding distance is approximately 4 km. The net Raman gain after pump depletion may be modest (e.g. 0.5 dB for small pump powers), but the distributed nature of the gain still improves the signal power profile compared to EDFA-only amplification.

Differentiable variant (`fit_dra_params_jax`)

The --raman_fit_method=jax option uses fit_dra_params_jax(), a differentiable twin of the default fitter. The key differences:

Backward pump boundary: Uses jax.custom_vjp wrapping scipy TNC. The forward pass calls scipy; the backward pass uses the implicit function theorem via jax.jacobian (reverse-mode, since odeint provides custom_vjp not custom_jvp).
Gradient flow: Rows 0-4 are detached via jax.lax.stop_gradient. Row 5 (ODE Raman gain) carries full gradients through the ODE solution, enabling pump power optimisation.
LM fitting: Uses jaxopt.LevenbergMarquardt with implicit_diff=False. The normalisation ch_norm is detached to prevent gradient leakage through the profile shape.

Enabling DRA

Enable DRA with the --use_raman_amp flag and provide backward (and optionally forward) Raman pump parameters:

python -m xlron.train.train \
  --env_type=rsa_gn_model \
  --topology_name=nsfnet_deeprmsa_directed \
  --link_resources=100 --k=5 --load=250 \
  --continuous_operation --use_raman_amp \
  --raman_pump_power_bw=0.3,0.3 \
  --raman_pump_freq_bw=205e12,210e12 \
  --coherent \
  --TOTAL_TIMESTEPS=1000000 --NUM_ENVS=100 \
  --EVAL_HEURISTIC --path_heuristic=ksp_ff

Pump powers are specified in Watts and pump frequencies in Hz, as comma-separated lists. Each entry corresponds to one pump laser. The same pump configuration is applied uniformly to all spans on all links.

Bandwidth Limiting with `--slots_per_band`

The --slots_per_band flag allows explicit control over how many frequency slots are allocated per band, overriding the default behaviour of filling each band's full spectral width. This is particularly useful with DRA to ensure the total modulated bandwidth stays within the triangular Raman validity range.

# C-band with 43 slots + L-band with 47 slots at 100 GHz slot size
# Total: 43 + 47 = 90 data slots + 1 gap slot = 91 link_resources
python -m xlron.train.train \
  --env_type=rsa_gn_model \
  --slot_size=100 --guardband=0 \
  --band_preference=C,L --slots_per_band=43,47 \
  --inter_band_gap_ghz=500 \
  --link_resources=91 --use_raman_amp \
  ...

The number of entries in --slots_per_band must match the number of bands in --band_preference. Each value is capped at the maximum number of slots that physically fit in that band.

ASE Noise with DRA

When DRA is enabled, the ASE noise is computed using a Friis cascade model that treats the DRA and EDFA as two amplifier stages in series. This is implemented in get_snr_dra() in isrs_gn_model_dra.py and uses row 5 (ODE Raman gain) of the fit parameters.

DRA noise figure: The backward-pumped DRA is modelled as a lumped amplifier with noise figure:

NF_DRA = 1/G_Raman + 2 * n_sp * (1 - 1/G_Raman)

where G_Raman is the per-channel ODE Raman gain (row 5, clamped to >= 1) and n_sp ≈ 1.13 is the phonon population factor at room temperature (300 K) for silica's dominant Raman shift of ~13 THz.

Hybrid noise figure: The DRA and EDFA are combined via the Friis cascade formula:

NF_hybrid = NF_DRA + (NF_EDFA - 1) / G_Raman

where NF_EDFA is the EDFA noise figure (from --amplifier_noise_figure or per-band CSV data). The Raman pre-amplification reduces the EDFA's noise contribution by a factor of G_Raman.

ASE per span: The total inline ASE uses the standard formula with the hybrid noise figure:

P_ASE_span = NF_hybrid * G_total * h * f * B

where G_total is the ISRS-aware total span gain (compensating fibre loss + lumped connector loss) computed by calculate_amplifier_gain_isrs(). The total ASE is accumulated linearly across all spans: P_ASE = num_spans * P_ASE_span.

Behaviour at different Raman gains: At low Raman gain (G_Raman ~ 1), NF_hybrid approaches NF_EDFA (DRA has negligible effect). At high Raman gain (>10 dB), the (NF_EDFA - 1) / G_Raman term becomes small and NF_hybrid approaches NF_DRA ≈ 2*n_sp ≈ 2.26 (3.5 dB) — the quantum-limited DRA noise figure. This limits the net OSNR improvement from increasing pump power.

The per-channel Raman gain varies across the spectrum (typically higher for L-band than C-band with standard pump configurations), which introduces a frequency-dependent tilt in the OSNR_ASE profile. This tilt can be managed through pump power distribution and per-band amplifier noise figure settings in the transceiver/amplifier CSV data file.

DRA Configuration Parameters

Flag	Default	Units	Description
`--use_raman_amp`	False	--	Enable DRA model for NLI calculation
`--raman_pump_power_fw`	None	W	Forward pump powers (comma-separated)
`--raman_pump_power_bw`	None	W	Backward pump powers (comma-separated)
`--raman_pump_freq_fw`	None	Hz	Forward pump frequencies (comma-separated)
`--raman_pump_freq_bw`	None	Hz	Backward pump frequencies (comma-separated)
`--raman_fit_method`	`triangular`	--	Fitting method: `triangular` (scipy, default) or `jax` (differentiable)
`--raman_max_bandwidth_thz`	15.0	THz	Max modulated bandwidth for triangular Raman validity
`--slots_per_band`	None	--	Comma-separated slot count per band (e.g. `43,47`)

Configuration Parameters

Fibre Parameters

Flag	Default	Units	Description
`--attenuation`	4.605e-5	Np/m	Fibre attenuation (0.2 dB/km)
`--nonlinear_coefficient`	1.2e-3	1/(W.m)	Kerr nonlinear coefficient
`--dispersion_coeff`	17e-6	s/m^2	Group velocity dispersion D (17 ps/nm/km)
`--dispersion_slope`	60.7	s/m^3	Dispersion slope dD/dlambda (0.067 ps/nm^2/km)
`--raman_gain_slope`	2.8e-17	1/(W.m.Hz)	ISRS Raman gain slope
`--ref_lambda`	1564e-9	m	Reference wavelength (C+L band centre)
`--max_span_length`	100000	m	Maximum span length (100 km)
`--coherent`	False	--	Coherent NLI accumulation across spans
`--uniform_spans`	True	--	Assume equal-length spans per link

ROADM Parameters

Flag	Default	Units	Description
`--roadm_express_loss`	5.0	dB	Insertion loss of express (pass-through) ROADM
`--roadm_add_drop_loss`	8.0	dB	Insertion loss of add/drop ROADM
`--roadm_noise_figure`	5.0	dB	Noise figure of ROADM booster amplifiers

Power Parameters

Flag	Default	Units	Description
`--max_power_per_fibre`	21.0	dBm	Maximum total launch power per fibre link
`--power_per_channel`	None	dBm	Per-channel launch power. If None, defaults to `max_power_per_fibre / link_resources`
`--launch_power_type`	fixed	--	Power assignment strategy: `fixed`, `tabular`, `rl`, or `scaled`

SNR and Modulation Parameters

Flag	Default	Units	Description
`--snr_margin`	1	dB	Margin added to modulation format SNR thresholds
`--mod_format_correction`	False	--	Enable modulation-format-dependent NLI correction
`--modulations_csv_filepath`	(built-in)	--	Path to modulation formats CSV file
`--calc_minimum_osnr`	False	--	Calculate `minimum_osnr` from spectral efficiency using GSNR threshold formula (ignores CSV values)
`--beta_fec`	1.5e-2	--	Pre-FEC BER target for GSNR threshold calculation (used with `--calc_minimum_osnr`)
`--fec_rate`	0.8	--	FEC code rate applied to accepted bitrate in `rmsa_gn_model` (`effective_bitrate = requested * fec_rate`)
`--fec_threshold`	0.28	--	FEC overhead fraction (28%) for throughput calculation in `rsa_gn_model`
`--num_subchannels`	1	--	Nyquist subchannels per slot. Divides slot bandwidth by N for SPM; XPM and ASE unchanged. See Nyquist Subchannel Modelling.
`--max_snr`	50.0	dB	Upper SNR clamp for observations
`--min_snr`	7.0	dB	Lower SNR limit for throughput calculation

Spectral Parameters

Flag	Default	Units	Description
`--slot_size`	12.5	GHz	Spectral width of each frequency slot
`--link_resources`	--	--	Number of frequency slots per link
`--guardband`	1	slots	Guard band between adjacent channels
`--enforce_band_gaps`	True	--	Mark inter-band gap slots as unusable (from `band_data.csv`)
`--band_data_filepath`	None	--	Path to band definition CSV (defaults to built-in `band_data.csv`)
`--band_preference`	None	--	Comma-separated band fill order for first-fit/last-fit (e.g. `C,L,S`)
`--slots_per_band`	None	--	Comma-separated slot count per band (overrides auto-fill, e.g. `43,47`)

Summary: `rsa_gn_model` vs `rmsa_gn_model`

Aspect	`rsa_gn_model`	`rmsa_gn_model`
Modulation format	Fixed (implicit)	Per-channel, SNR-adaptive
Action masking	Standard RSA (free-slot check)	Full GN model evaluation per candidate
Masking speed	Fast	Slow (GN model per candidate)
SNR check on action	Yes (post-placement)	Yes (post-placement)
Power budget check	Yes	Yes
Throughput computation	Shannon-Hartley (optional)	Not available
Key metric	Throughput (Gbit/s)	Blocking probability, accepted bitrate
Best for	Capacity studies, fast evaluation	Realistic RL training, NLI-aware allocation
Modulation format correction	Not applicable	Optional (`--mod_format_correction`)

GN Model Physical Layer

Overview of the Model

Link Model: Fibre Spans and Inline Amplifiers

Fibre Attenuation

Inline Amplifier ASE Noise

Nonlinear Interference (NLI)

Modulation Format Correction

Transceiver Noise

ROADM Node Model

Power Budget Enforcement

Spectral Band Data

Band Preference for Heuristic Slot Allocation

Modulation Formats

Calculating SNR Thresholds from Spectral Efficiency

Path-Level SNR Computation

Nyquist Subchannel Modelling (num_subchannels)

Motivation

How It Works

Best Practice

Worked Example: Per-Slot State Arrays

Centre Frequency Caching

rsa_gn_model Environment

Intended Use

How It Works

Observation Space

When to Use

rmsa_gn_model Environment

Intended Use

How It Works

Performance Considerations

FEC Code Rate

Metrics

Distributed Raman Amplification (DRA)

Physics Model

Triangular Raman Approximation

Fitting the DRA Parameters

Step 1: Solve the Raman ODE

Step 2: Extract per-channel ODE Raman gain (row 5)

Step 3: Fit the semi-analytical profile (rows 0-4)

Relationship between ODE and semi-analytical results

Differentiable variant (fit_dra_params_jax)

Enabling DRA

Bandwidth Limiting with --slots_per_band

ASE Noise with DRA

DRA Configuration Parameters

Configuration Parameters

Fibre Parameters

ROADM Parameters

Power Parameters

SNR and Modulation Parameters

Spectral Parameters

Summary: rsa_gn_model vs rmsa_gn_model

See Also

Nyquist Subchannel Modelling (`num_subchannels`)

`rsa_gn_model` Environment

`rmsa_gn_model` Environment

Differentiable variant (`fit_dra_params_jax`)

Bandwidth Limiting with `--slots_per_band`

Summary: `rsa_gn_model` vs `rmsa_gn_model`