after just a few minutes after I created my new space,my space was paused and 503 when restart.
ID:Root=1-693c2f0a-648372ed194a28d5316eb464
after just a few minutes after I created my new space,my space was paused and 503 when restart.
ID:Root=1-693c2f0a-648372ed194a28d5316eb464
Same issue for our BenCzechMark space. Any solution?
That information alone isn’t enough to pinpoint the issue, but I recalled that Spaces’ default settings have been acting strangely lately, so I’m passing this along. For some reason, the Python version is set to 3.13, which differs from the documentation…
Adding python_version: “3.10” to README.md should create build conditions similar to those in 2025:
---
title: 🇨🇿 BenCzechMark
emoji: 📊
colorFrom: gray
colorTo: blue
sdk: gradio
sdk_version: 4.42.0
python_version: "3.10"
app_file: app.py
Thank you for looking into it. I tried adding an explicit Python interpreter version. Restart/Factory Rebuild still produces
503
Something went wrong when restarting this Space.
Request ID: Root=1-69831bb9-4871e7833794d02c2cb7a365
Perhaps you could determine a specific issue from the log corresponding to the request id? Is there any more information I could provide?
Only HF staff can tell anything from the request ID…
Your space’s code is quite resilient to server-side changes.
Even if I look for flaws, the only possibilities I can find are things like bokeh not being pinned, the contents of environment variables, or errors related to other datasets or servers it references…
Anyway, if it doesn’t boot for some reason, I think it’ll result in a 503. If that’s logged in your container logs, I think we can pinpoint the cause.
This is a generic front-end error that usually happens when Hugging Face cannot successfully complete one of these phases:
The only reliable way to distinguish (1)/(2)/(3) is the Space’s Open Logs → Build logs / Container logs. HF’s docs explicitly point to these two log streams for debugging. (Hugging Face)
Your code reads several env vars with os.environ["..."]. If any are missing, Python raises KeyError and the container will crash immediately → restart yields 503.
In app.py:
HF_SPACE_TOKEN = os.environ["HF_SPACE_TOKEN"]HF_SPACE_ID = os.environ["HF_SPACE_ID"] (Hugging Face)In server.py:
HF_TOKEN = os.environ["HF_TOKEN"]HF_RESULTS_DATASET = os.environ["HF_RESULTS_DATASET"] (Hugging Face)Why this matters: even if the Space used to work, a rebuild/restart can expose missing secrets (or a renamed variable) instantly.
Workaround (code-hardening):
os.environ["X"] with os.environ.get("X") and fail gracefully with a clear UI message (or disable affected features) instead of crashing the whole app.LeaderboardServer.__init__() runs on import in app.py and immediately calls results_dataset_integrity_check() and update_leaderboard(). (Hugging Face)
update_leaderboard() calls _update_models_and_tournament_results(), which calls snapshot_download(... token=HF_TOKEN ...) into local_dir="./". (Hugging Face)
If:
HF_RESULTS_DATASET is wrong (typo, renamed dataset),HF_TOKEN is invalid/expired,then startup can fail → Space can’t restart (503).
Workarounds:
snapshot_download in try/except and show an “upstream unavailable” message while letting the UI boot.HF_TOKEN has permission to read/write the dataset repo referenced by HF_RESULTS_DATASET.Your UI includes logic to restart the Space if an external endpoint is not reachable:
check_significance_is_reachable_hook() calls check_significance_is_reachable()api.restart_space(repo_id=HF_SPACE_ID, token=HF_SPACE_TOKEN) (Hugging Face)gr.Timer(...).tick(...)).Separately, server.py’s check_significance_is_reachable() hits a fixed URL on czechllm.fit.vutbr.cz. (Hugging Face)
If HF egress/DNS intermittently fails to that host (or that host is down), your app may repeatedly restart itself. This can look like “it never stays up” and can manifest as frequent 503s to users.
Workaround (strongly recommended):
requirements.txt)Your requirements.txt is mostly pinned — except the last line: bokeh is unpinned. (Hugging Face)
Your code uses Bokeh to generate HTML/iframes (e.g., bokeh.embed.file_html(...), CDN.render(), etc.). (Hugging Face)
If a recent commit caused a rebuild (even a README edit), pip will install the latest Bokeh at rebuild time. If Bokeh introduced a breaking change relative to what your code previously ran with, you can get a startup crash (or a crash during leaderboard generation) that did not exist before.
Workaround:
bokeh==<version-you-tested>).requirements.txt via pip-compile / constraints to avoid “one unpinned package breaks everything”.Your README metadata looks structurally correct for a Gradio Space (sdk: gradio, app_file: app.py, python_version: '3.10', etc.). (Hugging Face)
So a simple YAML typo is less likely here.
Policy restrictions are possible in general, but nothing in this repo obviously screams “blocked category”. In practice, restrictions usually come with an explicit moderation banner or a message from HF; if you didn’t get that, treat it as lower probability.
Not from the outside.
That Request ID: Root=... is an internal tracing identifier. It can help Hugging Face staff locate the failing backend operation, but it’s not enough for external debugging without access to HF’s logs.
What you can provide that will usually pinpoint it in minutes:
HF’s docs and forum guidance consistently point to “Open Logs → Build / Container” as the primary debugging path. (Hugging Face)
Open Logs → Build
bokeh).Open Logs → Container
KeyError: 'HF_TOKEN' (or similar): env var missing. (Your code will crash on missing required vars.) (Hugging Face)snapshot_download: token/permission/dataset name issue. (Hugging Face)czechllm.fit.vutbr.cz reachability: remove the self-restart logic. (Hugging Face)Quick workaround to get something running
If logs show nothing useful and restart still fails
huggingface_hub (restart/pause/sleep time, etc.). (Hugging Face)HF_TOKEN, HF_RESULTS_DATASET, HF_SPACE_TOKEN, HF_SPACE_ID). (Hugging Face)HF_TOKEN no longer has correct permissions for HF_RESULTS_DATASET, causing snapshot_download to fail at startup. (Hugging Face)bokeh is unpinned. (Hugging Face)Unfortunately, I do not think the issue lies in repo directly. We had a parallel repo turned on, with the exactly same code: 🇨🇿 BenCzechMark - a Hugging Face Space by CZLC - and it was working fine (except for the python version issue you pointed out, which we fixed in both repos now). The problem is that current repo even has empty log.
Due to inactivity from huggingface support, we decided to simple create a new space 🇨🇿 BenCzechMark - a Hugging Face Space by CZLC with a same functionality. Unfortunately, we lost likes and traffic info.
Just in case, we also kept old space alive (though still not functional
). Available here":
Hi, Thanks for reporting. You’ll need to update the version of next.js to latest or a fixed version per https://nextjs.org/blog/CVE-2025-66478. Let us know if you run into further issues - happy to help!
Thanks, michellehbn!