Commits · 0d4a2c5767dc6136079b11ed45934143d309026e · Kirill Smelkov / linux

18 May, 2018 40 commits

drm/nouveau/kms: move display class instantiation to library · 0d4a2c57
Ben Skeggs authored May 08, 2018
```
This function is useful outside of DRM code.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
0d4a2c57
drm/nouveau/drm/nv50-: remove allocation of sw class · 512fa0b8
Ben Skeggs authored May 08, 2018
```
Hasn't been required for a long time.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
512fa0b8
drm/nouveau: no need to create ctxdma for push buffers on fermi and up · 92b4eaaf
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
92b4eaaf

drm/nouveau: remove fence wait code from deferred client work handler · 11e451e7

Ben Skeggs authored May 08, 2018

Fences attached to deferred client work items now originate from channels
belonging to the client, meaning we can be certain they've been signalled
before we destroy a client.

This closes a race that could happen if the dma_fence_wait_timeout() call
didn't succeed.  When the fence was later signalled, a use-after-free was
possible.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

11e451e7

drm/nouveau/gem: tie deferred unmapping of buffers to VMA fence completion · 470db8b7

Ben Skeggs authored May 08, 2018

As VMAs are per-client, unlike buffers, this allows us to avoid referencing
foreign fences (those that belong to another client/driver) from the client
deferred work handler, and prevent some not-fun race conditions that can be
triggered when a fence stalls.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

470db8b7

drm/nouveau/gem: attach fences to VMAs to track GPU usage · 0db912af

Ben Skeggs authored May 08, 2018

An upcoming patch will use these to fix issues related to the deferred
unmapping of GEM objects.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

0db912af

drm/nouveau/gem: lookup VMAs for buffers referenced by pushbuf ioctl · 19ca10d8

Ben Skeggs authored May 08, 2018

We previously only did this for push buffers, but an upcoming patch will
need to attach fences to all VMAs to resolve another issue.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

19ca10d8

drm/nouveau/gr/gp102-: setup stencil zbc · 4b2c71ed
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
4b2c71ed

drm/nouveau/gr/gp100-: use correct registers for zbc colour/depth setup · e9d03335

Ben Skeggs authored May 08, 2018

These were missed the first time around due to the driver version I traced
using the older registers still.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

e9d03335

drm/nouveau/gr/gp100-: fix attrib cb setup · 7a058a90
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
7a058a90
drm/nouveau/gr/gp100-: fix pagepool setup · 17f2d4df
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
17f2d4df

drm/nouveau/gr/gf100-gm10x: update register lists · 191e3232

Ben Skeggs authored May 08, 2018

There are differences on GM200 and newer too, but we can't fix them there
as they come from firmware packages.

A request has been made to NVIDIA to release updated firmware.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

191e3232

drm/nouveau/gr/gf100-: swap bundle and pagepool · 6f023332
Ben Skeggs authored May 08, 2018
```
Makes it easier to diff against RM traces.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
6f023332

drm/nouveau/gr/gf100-: calculate and use sm mapping table · 068cae74

Ben Skeggs authored May 08, 2018

There's a number of places that require this data, so let's separate out
the calculations to ensure they remain consistent.

This is incorrect for GM200 and newer, but will produce the same results
as we did before.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

068cae74

drm/nouveau/gr/gf100-: port zcull tile mapping calculations from NVGPU · d00ffc0c
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
d00ffc0c

drm/nouveau/gr/gf100-: port tile mapping calculations from NVGPU · 5f6474a4

Ben Skeggs authored May 08, 2018

There's also a couple of hardcoded tables for a couple of very specific
configurations that NVGPU's algorithm didn't work for.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

5f6474a4

drm/nouveau/gr/gf100-: virtualise trap_mp · 5c05a589
Ben Skeggs authored May 08, 2018
```
Required to support Volta.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
5c05a589
drm/nouveau/gr/gf100-: add missing reset sequence before golden context init · 74b6068b
Ben Skeggs authored May 08, 2018
```
RM and NVGPU both have a variant of this, we probably should too.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
74b6068b
drm/nouveau/gr/gf100-: delete duplicated grctx init code · 201ed6f6
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
201ed6f6
drm/nouveau/gr/gf100-: update r408840 where required · a5537f98
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
a5537f98
drm/nouveau/gr/gf100-: update 419a3c where required · 8d56fc48
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
8d56fc48
drm/nouveau/gr/gf100-: virtualise r418e94 · c2592ade
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
c2592ade
drm/nouveau/gr/gf100-: virtualise r419e00 · 18d17221
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
18d17221
drm/nouveau/gr/gf100-: update 419eb0 where required · ad45a92b
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
ad45a92b
drm/nouveau/gr/gf100-: note missing 418800 modifications · 5b54b5b9
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
5b54b5b9
drm/nouveau/gr/gf100-gf119: update 419cb8 where required · 99a3c67e
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
99a3c67e
drm/nouveau/gr/gf100-: support firmware-provided bundle/method everywhere · 0e5a5e86
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
0e5a5e86

drm/nouveau/gr/gf100-: virtualise tpc_mask + apply fixes from traces · fc360764

Ben Skeggs authored May 08, 2018

We weren't placing higher TPC IDs in the right place on some configurations.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

fc360764

drm/nouveau/gr/gf100-: virtualise r419f78 + apply fixes from traces · aa5e38dc

Ben Skeggs authored May 08, 2018

Removed from GK110[B]/GK208 as RM traces show it not being touched.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

aa5e38dc

drm/nouveau/gr/gf100-: virtualise gpc_tpc_nr · 60c0264a
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
60c0264a
drm/nouveau/gr/gf100-: virtualise r406500 · e7163b19
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
e7163b19

drm/nouveau/gr/gf100-: virtualise dist_skip_table + improve algorithm · 60770fa2

Ben Skeggs authored May 08, 2018

The algorithm for GM200 and newer matches RM for all the boards I have, but
I don't have enough data to try and figure something out for earlier boards,
so these will still write zeroes to the table as we did before.

The code in NVGPU isn't helpful here, it appears to handle specific cases.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

60770fa2

drm/nouveau/gr/gf100-gf119: modify max_ways_evict where required · c4a2b638

Ben Skeggs authored May 08, 2018

I don't think this is done after Fermi, NVGPU used to do it but removed
the code, and I've not seen RM traces touching it either.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

c4a2b638

drm/nouveau/gr/gf100-: virtualise alpha_beta_tables + improve algorithms · 43952c6f

Ben Skeggs authored May 08, 2018

I haven't yet been able to find a fully programatic way of calculating the
same mapping as NVIDIA for GF100-GF119, so the algorithm partially depends
on data tables for specific configurations.

I couldn't find traces for every possibility, so the algorithm will switch
to a mapping similar to what GK104-GM10x use if it encounters one. We did
the wrong thing before anyway, so shouldn't matter too much.

The algorithm used in the GK104 implementation was ported from NVGPU.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

43952c6f

drm/nouveau/gr/gf100-: virtualise rop_mapping · ff209c23
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
ff209c23
drm/nouveau/gr/gf100-: virtualise r4060a8 + apply fixes from traces · 9d8a80df
Ben Skeggs authored May 08, 2018
```
Also fixes some GPUs where we write too many registers.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
9d8a80df

drm/nouveau/gr/gf100-: virtualise tpc_per_gpc · e51f75d5

Ben Skeggs authored May 08, 2018

GM20B now also shares the same code, as NVGPU shows it doesn't need
special treatment.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

e51f75d5

drm/nouveau/gr/gf100-: virtualise sm_id/tpc_nr · fc740f54
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
fc740f54
drm/nouveau/gr/gf100-: virtualise patch_ltc, noting missing init · ea4a2bb5
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
ea4a2bb5
drm/nouveau/gr/gf100-: support firmware-provided sw_ctx everywhere · aedc49fd
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
aedc49fd