• Tomer Tayar's avatar
    accel/habanalabs/gaudi2: get the correct QM CQ info upon an error · ae303d88
    Tomer Tayar authored
    Upon a QM error, the address/size from both the CQ and the ARC_CQ are
    printed, although the instruction that led to the error was received
    from only one of them.
    
    Moreover, in case of a QM undefined opcode, only one of these
    address/size sets will be captured based on the value of ARC_CQ_PTR.
    However, this value can be non-zero even if currently the CQ is used, in
    case the CQ/ARC_CQ are alternately used.
    
    Under the assumption of having a stop-on-error configuration, modify to
    use CP_STS.CUR_CQ field to get the relevant CQ for the QM error.
    Signed-off-by: default avatarTomer Tayar <ttayar@habana.ai>
    Reviewed-by: default avatarOded Gabbay <ogabbay@kernel.org>
    Signed-off-by: default avatarOded Gabbay <ogabbay@kernel.org>
    ae303d88
gaudi2_regs.h 25.6 KB