Skip to content

Conversation

yfnaji
Copy link
Contributor

@yfnaji yfnaji commented Aug 17, 2025

Natural Cubic Spline Interpolation

This Pull Request implements the cubic spline interpolation as one of the polynomial interpolators mentioned in issue #5.

The implementation was benchmarked against SciPy’s CubicSpline.

Mathematical Background

We want to construct a cubic spline $S(x)$ that interpolates the points

$$\left(x_0, y_0\right), \left(x_1, y_1\right), \ldots,\left(x_n, y_n\right)$$

Since we are working with a natural cubic spline, we define

$$ z_i = \frac{d^2S(x_i)}{dx^2}, $$

the second derivative of the spline at each interpolation point and set

$$ z_0 = z_n = 0 $$

The full derivation of the cubic spline formulation is fairly involved, so some details are omitted here. For reference, the derivation can be found in the lecture notes by T. Gambill (2011) Interpolation/Splines (slides 14-27), which served as the main source for this implementation.

The equation of a natural cubic spline on the interval $x\in\left[x_i, x_{i+1}\right)$ is given by:

$$ S_i(x) = \frac{z_i}{6h_i}\left(x_{i+1} - x\right)^3 +\frac{z_{i+1}}{6h_i}\left(x - x_i\right)^3 +\left(\frac{y_{i+1}}{h_i} - \frac{h_i}{6}z_{i+1}\right)\left(x - x_i\right) +\left(\frac{y_{i}}{h_i} - \frac{h_i}{6}z_{i}\right)\left(x_{i+1} - x\right) $$

(Equation (1))

The next step is to determine suitable values for $z_i$ in order to construct the natural cubic spline.

Differentiating $S_i(x)$ yields

$$ S_i'\left(x\right) = -\frac{z_i}{2h_i}\left(x_{i+1}-x\right)^2 +\frac{z_{i+1}}{2h_i}\left(x-x_i\right)^2 +\frac{y_{i+1}}{h_i} -\frac{h_i}{6}z_{i+1} -\frac{y_i}{h_i} +\frac{h_i}{6}z_i $$

At the interpolation point $x_i$, the spline segments must join smoothly. This requires the gradients at the interpolation point to be equal on both sides. Hence, the following condition is enforced:

dspline_eq

Substituting these expressions and rearranging gives the relation

$$ h_{i-1}z_{i-1} + 2\left(h_i + h_{i-1}\right)z_i + h_iz_{i+1} = 6\left(b_i - b_{i-1}\right) $$

where

$$ b_i=\frac{y_{i+1} - y_i}{h_i} $$

This leads to an $(n-1)\times(n-1)$ system of equations:

tri_diag_sys

where

$$ u_i = 2\left(h_i + h_{i+1}\right) $$

Now, we can solve the above equation to obtain the values of $z_i$.

All diagonal and off-diagonal entries of $A$ are positive, implying that $A$ is positive definite. This property guarantees the existence of a Cholesky decomposition, allowing us to factor $A$ as:

$$ A=LL^T \Rightarrow LL^T \mathbf{z} = \mathbf{b} $$

We can solve for $\mathbf{z}$ in two steps. First, define $\mathbf{x} := L^T \mathbf{z}$, and then solve for $\mathbf{x}$ using forward substitution:

$$L \mathbf{x} = \mathbf{b}$$

Once $\mathbf{x}$ is obtained, solve for $\mathbf{z}$ using backward substitution:

$$L^T \mathbf{z} = \mathbf{x}$$

Amendments to the InterpolationIndex trait

Additional traits have been added to the type Delta:

  • Add: Delta + Delta = Delta
  • Sub: Delta - Delta = Delta
  • IntoValue: Converts a Delta into an InterpolationValue. This is required for cubic splines, as the implementation involves arithmetic operations between Delta and InterpolationValue types
  • Copy: Required when unpacking Delta values from vectors

Amendments to the InterpolationValue trait

  • MulAssign: value_1 *= value_2
  • num::FromPrimitive: In order to take an f64 value and convert it to the appropriate InterpolationValue value

Removing Unsigned integers from InterpolationIndex implementation

u8, u16, u32, u64, u128, and usize have been removed from the trait implementation of InterpolationIndex because cubic splines can produce negative Delta values. The trait implementation for i8, i16, i32, i64, i128, and isize should be sufficient for cases where integers are used as indices.

@yfnaji yfnaji mentioned this pull request Aug 18, 2025
@avhz
Copy link
Owner

avhz commented Aug 30, 2025

Hi @yfnaji :) Thanks for this one, splines have been needed for a while!

I'll take a closer look tomorrow afternoon, and hopefully merge it

@yfnaji
Copy link
Contributor Author

yfnaji commented Aug 31, 2025

Hey @avhz - just saw the Clippy warnings, I am working on them now, but still feel free to review and comment on this PR (:

@avhz
Copy link
Owner

avhz commented Aug 31, 2025

@yfnaji
I did a bit of refactoring of the Curves today, which required adding Send + Sync to the interpolators. Doesn't look like it created any conflicts, so just a heads up :)

Copy link
Owner

@avhz avhz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A couple points

($b:ty, $c:ty) => {
impl IntoValue<$c> for $b {
fn into_value(self) -> $c {
self.as_seconds_f64()
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm a bit unsure about fixing a unit like this, why seconds?

Copy link
Contributor Author

@yfnaji yfnaji Sep 2, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

At some point we need to apply some arithmetic operations to the Delta type and so a complication arises when apply arithmetic operations between an f64 (the ValueType for time::Date ) and time::Duration e.g. in lower_tri_transpose_inv_times_diag_inv() (line 207):

ValueType::one() / diagonal[j].into_value()

where diagonal[j] is a Delta type.

As for converting time::Duration (to f64), the only possible method that provides this is as_seconds_f64().

We could perhaps create our own conversion?

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If I understand correctly, we can't do this because then we are setting the units in seconds which would make no sense on a yield curve, for example.

e.g. for a 1 day delta on a curve, the value would be 86,400.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I created an ad-hoc commit that removes all references to dates in the interpolation methods. This also required changes to curve.rs, since the Curve struct previously relied on date types. I have updated these to f64, representing year fractions, similar to the approach in QuantLib.

I also commented out the from_nodes() method for now, since it requires BTreeMap<OrderedFloat<f64>, f64> to implement PyFunctionArgument, which seems non-trivial without replacing it with PyDict or another Python object. I may need some help with this part.

All date-removal changes have been grouped into a single commit. If you do not like it, we can revert back to commit b72b0e0

@yfnaji yfnaji changed the title Cubic Spline Interpolation Natural Cubic Spline Interpolation Sep 2, 2025
yfnaji added 17 commits October 5, 2025 17:52
@yfnaji
Copy link
Contributor Author

yfnaji commented Oct 7, 2025

Hi @avhz

This PR has been amended to use Cholesky decomposition and elimination methods instead of LDLT, which has yielded more accurate results (more consistent with SciPy). This, however, introduces an additional trait bound for square roots, needed for the Cholesky decomposition.

I also removed date references from the Interpolators, since implementing cubic splines requires interaction between Delta types and Values; some meaningul conversion of time durations necessary. I have needed to amend curve.rs to reflect these changes (see comment).

This change may also allow us to simplify InterpolationIndex and InterpolationValue. However, doing so increases the complexity of the trait bounds when accounting for interactions between the two. Their distinction was useful when dates were present, since Delta had its own type, but that separation may now be obsolete if we continue without dates.

Note that I also commented out the from_nodes() method in curve.rs for now, since implementing PyFunctionArgument for BTreeMap<OrderedFloat<f64>, f64> is non-trivial without replacing it with a PyDict or another object. I may need some help with this part.

I’d appreciate feedback on whether these changes should remain grouped together in this PR, or if it would be clearer to split them into separate PRs. In particular, I’m unsure whether removing dates (and the possible obsolescence of InterpolationIndex vs. InterpolationValue) should be part of this PR or discussed separately.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants