mirror of
https://mirrors.bfsu.edu.cn/git/linux.git
synced 2024-12-13 22:14:20 +08:00
habanalabs: no consecutive err when user context is enabled
Consecutive error protects a device reset loop from being triggered due to h/w issues and enters the device into an unavailable state. When user may cause the error, an unavailable state will prevent the user from running its workloads. The commit prevents entering consecutive state when a user context is enabled. Signed-off-by: Tal Cohen <talcohen@habana.ai> Reviewed-by: Oded Gabbay <ogabbay@kernel.org> Signed-off-by: Oded Gabbay <ogabbay@kernel.org>
This commit is contained in:
parent
1b363adc7f
commit
4a9c6e2cdf
@ -1320,6 +1320,10 @@ static void handle_reset_trigger(struct hl_device *hdev, u32 flags)
|
||||
{
|
||||
u32 cur_reset_trigger = HL_RESET_TRIGGER_DEFAULT;
|
||||
|
||||
/* No consecutive mechanism when user context exists */
|
||||
if (hdev->is_compute_ctx_active)
|
||||
return;
|
||||
|
||||
/*
|
||||
* 'reset cause' is being updated here, because getting here
|
||||
* means that it's the 1st time and the last time we're here
|
||||
|
Loading…
Reference in New Issue
Block a user