[MNT] Diagnose and address long test runtimes (#1633) by Abhishek9639 · Pull Request #1692 · openml/openml-python

Abhishek9639 · 2026-02-26T17:41:07Z

[MNT] diagnose and address long test runtimes. Closes #1633

Changes

Current CI test runs take 1–2+ hours. This PR diagnoses the bottleneck and implements several improvements:

Root Cause

The production_server tests (74 tests) make live API calls to openml.org, taking ~1h 23m in CI even with 4-worker parallelization.

Improvements

Global per-test timeout (pyproject.toml)
- Added timeout = 600 (10 min) to [tool.pytest.ini_options]
- Prevents any single test from hanging indefinitely
CI workflow improvements (.github/workflows/test.yml)
- Changed --durations=20 → --durations=0 to report ALL test durations for diagnosis
- Added explicit --timeout=600 to all 3 pytest invocations
Fixture optimization (tests/conftest.py)
- Changed verify_cache_state fixture scope from function → module
- Reduces redundant filesystem I/O (was running before/after EVERY test)
Benchmark script (scripts/profile_tests.sh)
- New script for easy local test duration profiling
- Configurable marker filters

Test Distribution Analysis

Category	Count	CI Time
All tests	368
`production_server`	74	~1h 23m (bottleneck)
`test_server`	196	excluded from CI
`sklearn`-only	6	~1 min
Non-server	99	fast

Verification

All pre-commit checks pass (ruff, ruff-format, mypy)
All 368 tests still collect correctly

Abhishek9639 · 2026-02-26T17:49:21Z

Hii @geetu040 and @fkiraly,
Fixed the code quality checks. All pre-commit checks are now passing.
Please review it.

geetu040

Thanks, scripts/profile_tests.sh file will be used, but I not sure about other duration and timeout related changes. See comments below.

.github/workflows/test.yml

scripts/profile_tests.sh

tests/conftest.py

geetu040 · 2026-03-01T16:22:58Z

you should mention the issue #1633 without the keyword Fixes #1633 since it doesn't close it, rather adds script to help debug this.

Abhishek9639 · 2026-03-01T16:23:30Z

@geetu040,
I’ve made all the changes you suggested. Could you please review it once?
And if any further changes are needed, please let me know.
Thanks

geetu040

see the comment below

scripts/profile_tests.sh

Abhishek9639 · 2026-03-01T16:45:35Z

@geetu040,
Updated Added -n for workers with --dist=load and removed -q for full output the script now mimics the exact CI pytest command. Please review
If any further changes are needed, please let me know.
Thanks

geetu040

LGTM

@fkiraly please merge.

- Add global per-test timeout (600s) to pytest config - CI: report all test durations (--durations=0) for diagnosis - CI: add explicit --timeout=600 to prevent hanging tests - Optimize verify_cache_state fixture: scope function -> module - Add scripts/profile_tests.sh for local duration profiling

…script - Revert CI workflow to original --durations=20 (no timeout) - Remove global timeout from pyproject.toml - Revert conftest.py verify_cache_state scope to function - Update profile_tests.sh: accept CLI args (-m, -d, -t, -o) with defaults

- Add -n flag for parallel workers (default: 4) - Add --dist=load to distribute tests across workers - Remove -q flag for full pytest output - Mimics exact pytest command used in CI

PGijsbers · 2026-04-07T13:17:49Z

Is this ready for review or not?:) I was pinged for this but I see there are still commits being pushed

codecov-commenter · 2026-04-07T13:27:42Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 54.67%. Comparing base (e653ef6) to head (144cee9).

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1692   +/-   ##
=======================================
  Coverage   54.67%   54.67%           
=======================================
  Files          63       63           
  Lines        5108     5108           
=======================================
  Hits         2793     2793           
  Misses       2315     2315

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Abhishek9639 · 2026-04-08T08:10:16Z

Hi @PGijsbers,
Yes, it's ready for review now! Sorry about the confusion the force-push was just to clean up the commit history.
No more changes planned.
Thanks

geetu040 suggested changes Mar 1, 2026

View reviewed changes

.github/workflows/test.yml Outdated Show resolved Hide resolved

scripts/profile_tests.sh Show resolved Hide resolved

tests/conftest.py Outdated Show resolved Hide resolved

geetu040 suggested changes Mar 1, 2026

View reviewed changes

scripts/profile_tests.sh Show resolved Hide resolved

geetu040 approved these changes Mar 1, 2026

View reviewed changes

Abhishek9639 closed this Mar 7, 2026

Abhishek9639 reopened this Mar 7, 2026

Abhishek added 3 commits April 7, 2026 17:15

Update profile_tests.sh: add -n workers, --dist=load, remove -q

144cee9

- Add -n flag for parallel workers (default: 4) - Add --dist=load to distribute tests across workers - Remove -q flag for full pytest output - Mimics exact pytest command used in CI

Abhishek9639 force-pushed the mnt/diagnose-long-test-runtimes branch from 7114df4 to 144cee9 Compare April 7, 2026 11:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MNT] Diagnose and address long test runtimes (#1633)#1692

[MNT] Diagnose and address long test runtimes (#1633)#1692
Abhishek9639 wants to merge 3 commits intoopenml:mainfrom
Abhishek9639:mnt/diagnose-long-test-runtimes

Abhishek9639 commented Feb 26, 2026 •

edited

Loading

Uh oh!

Abhishek9639 commented Feb 26, 2026 •

edited

Loading

Uh oh!

geetu040 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

geetu040 commented Mar 1, 2026

Uh oh!

Abhishek9639 commented Mar 1, 2026

Uh oh!

geetu040 left a comment •

edited

Loading

Uh oh!

Uh oh!

Abhishek9639 commented Mar 1, 2026

Uh oh!

geetu040 left a comment

Uh oh!

PGijsbers commented Apr 7, 2026

Uh oh!

codecov-commenter commented Apr 7, 2026

Uh oh!

Abhishek9639 commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

Abhishek9639 commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

[MNT] diagnose and address long test runtimes. Closes #1633

Changes

Root Cause

Improvements

Test Distribution Analysis

Verification

Uh oh!

Abhishek9639 commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

geetu040 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

geetu040 commented Mar 1, 2026

Uh oh!

Abhishek9639 commented Mar 1, 2026

Uh oh!

geetu040 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Abhishek9639 commented Mar 1, 2026

Uh oh!

geetu040 left a comment

Choose a reason for hiding this comment

Uh oh!

PGijsbers commented Apr 7, 2026

Uh oh!

codecov-commenter commented Apr 7, 2026

Codecov Report

Uh oh!

Abhishek9639 commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Abhishek9639 commented Feb 26, 2026 •

edited

Loading

Abhishek9639 commented Feb 26, 2026 •

edited

Loading

geetu040 left a comment •

edited

Loading