• Linus Torvalds's avatar
    Merge tag 'edac_updates_for_v6.9' of git://git.kernel.org/pub/scm/linux/kernel/git/ras/ras · b0402403
    Linus Torvalds authored
    Pull EDAC updates from Borislav Petkov:
    
     - Add a FRU (Field Replaceable Unit) memory poison manager which
       collects and manages previously encountered hw errors in order to
       save them to persistent storage across reboots. Previously recorded
       errors are "replayed" upon reboot in order to poison memory which has
       caused said errors in the past.
    
       The main use case is stacked, on-chip memory which cannot simply be
       replaced so poisoning faulty areas of it and thus making them
       inaccessible is the only strategy to prolong its lifetime.
    
     - Add an AMD address translation library glue which converts the
       reported addresses of hw errors into system physical addresses in
       order to be used by other subsystems like memory failure, for
       example. Add support for MI300 accelerators to that library.
    
     - igen6: Add support for Alder Lake-N SoC
    
     - i10nm: Add Grand Ridge support
    
     - The usual fixlets and cleanups
    
    * tag 'edac_updates_for_v6.9' of git://git.kernel.org/pub/scm/linux/kernel/git/ras/ras:
      EDAC/versal: Convert to platform remove callback returning void
      RAS/AMD/FMPM: Fix off by one when unwinding on error
      RAS/AMD/FMPM: Add debugfs interface to print record entries
      RAS/AMD/FMPM: Save SPA values
      RAS: Export helper to get ras_debugfs_dir
      RAS/AMD/ATL: Fix bit overflow in denorm_addr_df4_np2()
      RAS: Introduce a FRU memory poison manager
      RAS/AMD/ATL: Add MI300 row retirement support
      Documentation: Move RAS section to admin-guide
      EDAC/versal: Make the bit position of injected errors configurable
      EDAC/i10nm: Add Intel Grand Ridge micro-server support
      EDAC/igen6: Add one more Intel Alder Lake-N SoC support
      RAS/AMD/ATL: Add MI300 DRAM to normalized address translation support
      RAS/AMD/ATL: Fix array overflow in get_logical_coh_st_fabric_id_mi300()
      RAS/AMD/ATL: Add MI300 support
      Documentation: RAS: Add index and address translation section
      EDAC/amd64: Use new AMD Address Translation Library
      RAS: Introduce AMD Address Translation Library
      EDAC/synopsys: Convert to devm_platform_ioremap_resource()
    b0402403
MAINTAINERS 724 KB