Firmware (LuxOS)
Knowledge Base
Overtemperature Log Identification

LuxOS Log Entry for Overtemperature Issues

Problem Description

LuxOS has ceased hashing operations, potentially due to excessive temperatures. This guide aims to assist in identifying relevant log entries for troubleshooting.

Solution

If a miner equipped with LuxOS firmware is suspected of overheating, consider the following steps using the LuxOS logs for a thorough investigation:

ℹ️

Note: These steps need to be completed before the miner in question is rebooted.

S21 Chip Overheating

Step 1: Identify Power Off Events

Search for "PowerOff" within the LuxOS log files. The presence of this log entry indicates why the hashboards were shut down. This step is crucial, even if the initial cause appears unrelated to overheating.

2024-04-18T20:02:24.561941Z DEBUG ThreadId(10) luxminer::watcher::bus: PowerOff: hashboard_id=1
2024-04-18T20:02:24.561996Z DEBUG ThreadId(10) luxminer::watcher::bus: NoAutoReboot: hashboard_id=1
2024-04-18T20:02:24.562001Z WARN ThreadId(19) luxminer::asic::reboot: System completely powered off
2024-04-18T20:02:24.562018Z DEBUG ThreadId(10) luxminer::watcher::bus: PowerOff: hashboard_id=2
2024-04-18T20:02:24.562037Z DEBUG ThreadId(10) luxminer::watcher::bus: NoAutoReboot: hashboard_id=2
2024-04-18T20:02:24.562057Z DEBUG ThreadId(10) luxminer::watcher::bus: PowerOff: hashboard_id=0

Step 2: Check for High Temperature Warnings

Immediately preceding the power-off logs, look for messages indicating high chip die temperatures:

2024-04-18T20:02:24.537009Z WARN ThreadId(16) luxminer::fan: Chip die temp exceeds 100C! board=1, chip=27, temperature=100.56916999999999C

This warning suggests either a malfunctioning chip reporting incorrect temperatures or an actual overheating event.

S19 Overheating

Step 1: Identify Power Off Events

Search for "poweroff" within the LuxOS log files. The presence of this log entry indicates why the hashboards were shut down. This step is crucial, even if the initial cause appears unrelated to overheating.

2024-04-19T20:26:46.700004Z ERROR ThreadId(10) luxminer::fan: sent poweroff signal to hashboard due to high temperature! hashboard_id=2 temperature=73.0 limit=73.0
2024-04-19T20:26:46.700448Z DEBUG ThreadId(07) luxminer::watcher::bus: RebootBoard: hashboard_id=2, reason=HighTemperature
2024-04-19T20:26:46.780706Z  WARN ThreadId(12) luxminer::asic::reboot: Rebooting board; automatic start scheduled [stage2, 1/5 - temperature too high] hashboard_id=2
2024-04-19T20:26:46.781006Z DEBUG ThreadId(07) luxminer::watcher::bus: PowerOff: hashboard_id=2

In this example from a S19, we can see that hashboard 2 is shutting down due to reaching the dangerous board temperature limit of 73C. This indicates actual heat or air flow issues with the miner itself.

ℹ️

Note: These are not the only indicators of overheating issues in the logs. If additional assistance is required, please contact Luxor Firmware Support.

Documentation