[Roadmap area] Solver improvements #3910

brosaplanella · 2024-03-20T13:29:35Z

I'd like to continue making pybamm easier to work with for the parameter inference libraries like PyBOP (https://github.com/pybop-team/PyBOP). At the moment I think this includes:

GPU as well as multithreaded support for running many simulations in parallel (Add support for MLIR-based expression evaluation #3826)
Integrate sensitivities more into Simulation (generalise simulation, include sensititivies #3834)
Run multiple simulations from different starting points (IDAKLU solver: add option for multiple initial conditions #3713). Should be possible to reset the starting points during the middle of a solve
This is generally "making it faster", so profiling would fit in here (see delay xarray.DataArray initialization #3862)
others?....

Originally posted by @martinjrobins in #3839 (comment)

rtimms · 2024-06-07T15:18:40Z

Not solver, but this is related to speed-up: #4058

martinjrobins · 2024-07-15T18:17:01Z

MarcBerliner · 2024-07-31T15:02:10Z

Here are my thoughts on improving the solver. Please let me know if you have comments or questions @martinjrobins and others.

Solver options

Add more idaklu solver options (number 2) #4282

Initialization

Improve consistent initialization speed and robustness #4301

Time stepping

In PyBaMM, the solver currently stops at all values in the t_eval vector. It seems that we use t_eval and set some dt for a few distinct purposes:

To enforce time-dependent discontinuities within a single step, like with a drive cycle
To set the data collection frequency (as in the period experiment kwarg)
Setting a small dt to force better solver convergence (take smaller time steps)

These three reasons for setting a dt are all valid, but stopping the simulation at all time steps can have a drastic impact on the adaptive time stepping and performance. For example, consider a full C/10 discharge with

a. t_eval = [0, 36000] (i.e., no stops)
b. t_eval = np.arange(0, 36060, 60) (pybamm default)

If we compare the solver stats,

Number of steps: a. 165 vs b. 715
Number of residual calls: a. 288 vs b. 823
Number of linear solver setups: a. 28 vs b. 91
Number of nonlinear iterations: a. 286 vs b. 821
DAE integration time: a. 25.5 ms vs b. 97.1 ms

Even though we solve the same system, the dense t_eval b. is nearly 4x slower! To address these issues, I propose the following changes that align the time-stepping options with Julia's differential equation solvers (see Output Control):

(Breaking) By default, save every t and y determined by IDA's adaptive time stepping algorithm. This eliminates issues like the one above with the C/10 discharge. We can accurately post-interpolate the solution onto specific times with IDA's Hermite interpolator. This is a huge benefit for parameter estimation because we will always obtain the full solution accuracy regardless of the data collection frequency.
(Non-breaking) Change the description of t_eval to match Julia's tstops: "Denotes extra times that the time-stepping algorithm must step to. This should be used to help the solver deal with discontinuities and singularities, since stepping exactly at the time of the discontinuity will improve accuracy." With this option, drive cycles in 1. still work
(Non-breaking) Add a solver option that matches Julia's saveat: "Denotes specific times to save the solution at, during the solving phase. The solver will save at each of the timepoints in this array in the most efficient manner available to the solver." This addresses the data collection frequency in 2. without negatively affecting performance since it interpolates the solution with IDAGetDky().
(Non-breaking) Discourage modifying t_eval for performance issues and encourage modifying appropriate solver options (rtol, atol, dt_max, etc.), which addresses 3.

martinjrobins · 2024-07-31T18:55:39Z

yea, agree that it would be great to get rid of IDASetStopTime! or at least reduce the number of times we use it. Would this play nicely with the casadi solver?

MarcBerliner · 2024-07-31T21:00:29Z

Would this play nicely with the casadi solver?

If we can access IDA's internal time steps via Casadi, I think we can do most of this stuff

agriyakhetarpal · 2024-08-06T14:35:41Z

Corollary from the PyBaMM developer meeting on 05/08/2024 as a part of PyBaMM running in WASM:

CasADi can be compiled with the Emscripten toolchain starting with the upcoming v3.6.6 and will be available in Pyodide with the next patch or minor version (either v0.26.3 or v0.27.0).
- uses NumPy for computations, so no floating point imprecisions were noticed (so far)
Compiling IDAS doesn't take too much effort; linking with KLU could be difficult. I'm unsure if IREE is available/possible. Reference BLAS/LAPACK implementations should be available and configurable.
- the IDAKLU solver should be possible to compile in theory, but threading/OpenMP will need to be disabled via I4087-multiprocessing #4260 or similar
- the current sdist is broken, can be fixed via further changes in scikit-build-core CI Builds #4242
Best way to proceed with this is to compile PyBaMM without the IDAKLU solver for now (i.e., as a pure Python package) and provide pybamm.CasadiSolver() through a Pyodide instance, which can then be used in the docs for the example notebooks or other in-browser uses (more or less, JupyterLite)
- ship a pure Python wheel (with BUILD_IDAKLU set to OFF) in addition to platform-specific wheels, since pip always chooses the most specific wheel available

martinjrobins · 2024-08-06T14:55:25Z

@agriyakhetarpal: FYI, I've compiled IDA with KLU using enscripten in the past, see this repo: https://github.com/martinjrobins/diffeq-runtime. However I agree that focusing on the casadi solver for now is the best approach since you have it already compiled and in Pyodide

agriyakhetarpal · 2024-08-06T15:16:22Z

Thanks for the resource, @martinjrobins! I see you've built a static lib for KLU and also used NO_LAPACK – should work well. I'll tinker with the settings and maybe I can reduce the binary size a bit :)

MarcBerliner · 2024-08-27T14:58:53Z

To improve the speed of our ODE models, I'd like to add CVODE from SUNDIALS to our suite of C solvers in addition to IDA. I have a few questions about the implementation, and I'd like to hear your thoughts @jsbrittain, @martinjrobins, @pipliggins, and others.

In C++, we can make a base solver class and derived classes for IDA and CVODE. The solver code structure for both will be very similar, so this will help with code reuse. However, this will cause a headache for some of the existing PRs.
The CasadiSolver automatically determines if the system is an ODE or a DAE and passes it to the appropriate solver. I think this approach also makes sense for our C solvers. If we do take this approach, then...
One minor issue is that the IDAKLUSolver name will no longer be accurate. Since we plan on streamlining the IDAKLU installation process and changing the default away from CasadiSolver, maybe we can also rename IDAKLUSolver to SundialsSolver or something. This might make the "new and improved default solver" announcement splashier (cc @valentinsulzer @rtimms).

And just FYI, I don't plan on starting this work for at least a couple of weeks.

brosaplanella assigned martinjrobins Mar 20, 2024

martinjrobins mentioned this issue May 14, 2024

refactor multiprocessing and multiple inputs #4087

Open

martinjrobins mentioned this issue Jul 30, 2024

Improve consistent initialization speed and robustness #4301

Merged

6 tasks

martinjrobins mentioned this issue Aug 2, 2024

Time stepping in IDA - minimise solver restarts #4312

Closed

4 tasks

MarcBerliner mentioned this issue Sep 3, 2024

Add CVODE solver #4407

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Roadmap area] Solver improvements #3910

[Roadmap area] Solver improvements #3910

brosaplanella commented Mar 20, 2024

rtimms commented Jun 7, 2024

martinjrobins commented Jul 15, 2024 •

edited

Loading

MarcBerliner commented Jul 31, 2024 •

edited

Loading

martinjrobins commented Jul 31, 2024

MarcBerliner commented Jul 31, 2024

agriyakhetarpal commented Aug 6, 2024

martinjrobins commented Aug 6, 2024 •

edited

Loading

agriyakhetarpal commented Aug 6, 2024

MarcBerliner commented Aug 27, 2024

[Roadmap area] Solver improvements #3910

[Roadmap area] Solver improvements #3910

Comments

brosaplanella commented Mar 20, 2024

rtimms commented Jun 7, 2024

martinjrobins commented Jul 15, 2024 • edited Loading

Sensitivities

Multithreaded/GPU support:

Post Processing optimisation and refactoring

Solver refactoring

Solver documentation

MarcBerliner commented Jul 31, 2024 • edited Loading

Solver options

Initialization

Time stepping

martinjrobins commented Jul 31, 2024

MarcBerliner commented Jul 31, 2024

agriyakhetarpal commented Aug 6, 2024

martinjrobins commented Aug 6, 2024 • edited Loading

agriyakhetarpal commented Aug 6, 2024

MarcBerliner commented Aug 27, 2024

martinjrobins commented Jul 15, 2024 •

edited

Loading

MarcBerliner commented Jul 31, 2024 •

edited

Loading

martinjrobins commented Aug 6, 2024 •

edited

Loading