Gentoo Archives: gentoo-user

From: Peter Humphrey <peter@××××××××××××.uk>
To: gentoo-user@l.g.o
Subject: Re: [gentoo-user] Instrumenting the GPU
Date: Mon, 13 Nov 2017 15:46:11
Message-Id: 2924575.R0oEocLxqo@peak
In Reply to: Re: [gentoo-user] Instrumenting the GPU by Daniel Frey
1 On Monday, 13 November 2017 15:12:56 GMT Daniel Frey wrote:
2 > On 11/13/17 02:59, Peter Humphrey wrote:
3 > > Hello list,
4 > >
5 > > I'm hunting a problem with cooling in this box, and I've got as far as
6 > > suspecting my new AMD WX 5100 GPU.
7 > >
8 > > One of my BOINC projects causes the GPU temperature, as shown by
9 > > gkrellm, to shoot up to 75C or more and cause intolerable system
10 > > cooling noise. If I suspend that project but leave the other seven
11 > > running, the temperature returns to what I hope is a normal 55C. Those
12 > > seven projects are supposed to use the GPU, but I'm not sure whether
13 > > they do in fact.
14 > >
15 > > Is there any way I can monitor what is using the GPU, to find out?
16 >
17 > I don't know if there's a utility for consumer level cards that can do
18 > this. I do remember for Nvidia there's nvidia-smi but I don't think it
19 > will list processes for desktop cards.
20
21 This isn't consumer grade (look it up in your local shops ;-) ):
22
23 # lspci -v -s 01:00.0
24 01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI]
25 Ellesmere [Radeon Pro WX 5100] (prog-if 00 [VGA controller])
26 Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon
27 Pro WX 5100]
28 Flags: bus master, fast devsel, latency 0, IRQ 34, NUMA node 0
29 Memory at c0000000 (64-bit, prefetchable) [size=256M]
30 Memory at d0000000 (64-bit, prefetchable) [size=2M]
31 I/O ports at e000 [size=256]
32 Memory at fbe00000 (32-bit, non-prefetchable) [size=256K]
33 Expansion ROM at 000c0000 [disabled] [size=128K]
34 Capabilities: [48] Vendor Specific Information: Len=08 <?>
35 Capabilities: [50] Power Management version 3
36 Capabilities: [58] Express Legacy Endpoint, MSI 00
37 Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
38 Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1
39 Len=010 <?>
40 Capabilities: [150] Advanced Error Reporting
41 Capabilities: [200] #15
42 Capabilities: [270] #19
43 Capabilities: [2b0] Address Translation Service (ATS)
44 Capabilities: [2c0] Page Request Interface (PRI)
45 Capabilities: [2d0] Process Address Space ID (PASID)
46 Capabilities: [320] Latency Tolerance Reporting
47 Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
48 Capabilities: [370] L1 PM Substates
49 Kernel driver in use: amdgpu
50
51 > The only other generic ones I can think of are cuda-z and gputop. Have
52 > you tried one of those? Although I don't think it'll give you the
53 > information you need either.
54
55 As it's AMD, not nVidia, nvidia-smi and cuda aren't suitable. I hadn't heard
56 of GPU Top - thanks. I'll have a look at it.
57
58 I forgot to add that I'm using the proprietary dev-libs/amdgpu-pro-opencl
59 because mesa hasn't caught up yet.
60
61 --
62 Regards,
63 Peter.

Replies

Subject Author
Re: [gentoo-user] Instrumenting the GPU R0b0t1 <r030t1@×××××.com>