1. 11 Mar, 2016 2 commits
    • Christophe Leroy's avatar
      powerpc/8xx: Map linear kernel RAM with 8M pages · a372acfa
      Christophe Leroy authored
      On a live running system (VoIP gateway for Air Trafic Control), over
      a 10 minutes period (with 277s idle), we get 87 millions DTLB misses
      and approximatly 35 secondes are spent in DTLB handler.
      This represents 5.8% of the overall time and even 10.8% of the
      non-idle time.
      Among those 87 millions DTLB misses, 15% are on user addresses and
      85% are on kernel addresses. And within the kernel addresses, 93%
      are on addresses from the linear address space and only 7% are on
      addresses from the virtual address space.
      
      MPC8xx has no BATs but it has 8Mb page size. This patch implements
      mapping of kernel RAM using 8Mb pages, on the same model as what is
      done on the 40x.
      
      In 4k pages mode, each PGD entry maps a 4Mb area: we map every two
      entries to the same 8Mb physical page. In each second entry, we add
      4Mb to the page physical address to ease life of the FixupDAR
      routine. This is just ignored by HW.
      
      In 16k pages mode, each PGD entry maps a 64Mb area: each PGD entry
      will point to the first page of the area. The DTLB handler adds
      the 3 bits from EPN to map the correct page.
      
      With this patch applied, we now get only 13 millions TLB misses
      during the 10 minutes period. The idle time has increased to 313s
      and the overall time spent in DTLB miss handler is 6.3s, which
      represents 1% of the overall time and 2.2% of non-idle time.
      Signed-off-by: default avatarChristophe Leroy <christophe.leroy@c-s.fr>
      Signed-off-by: default avatarScott Wood <oss@buserror.net>
      a372acfa
    • Christophe Leroy's avatar
      powerpc/8xx: Save r3 all the time in DTLB miss handler · 913a6b3d
      Christophe Leroy authored
      We are spending between 40 and 160 cycles with a mean of 65 cycles in
      the DTLB handling routine (measured with mftbl) so make it more
      simple althought it adds one instruction.
      With this modification, we get three registers available at all time,
      which will help with following patch.
      Signed-off-by: default avatarChristophe Leroy <christophe.leroy@c-s.fr>
      Signed-off-by: default avatarScott Wood <oss@buserror.net>
      913a6b3d
  2. 09 Mar, 2016 12 commits
  3. 05 Mar, 2016 16 commits
  4. 03 Mar, 2016 8 commits
  5. 02 Mar, 2016 2 commits