1 |
On Monday, 13 November 2017 15:12:56 GMT Daniel Frey wrote: |
2 |
> On 11/13/17 02:59, Peter Humphrey wrote: |
3 |
> > Hello list, |
4 |
> > |
5 |
> > I'm hunting a problem with cooling in this box, and I've got as far as |
6 |
> > suspecting my new AMD WX 5100 GPU. |
7 |
> > |
8 |
> > One of my BOINC projects causes the GPU temperature, as shown by |
9 |
> > gkrellm, to shoot up to 75C or more and cause intolerable system |
10 |
> > cooling noise. If I suspend that project but leave the other seven |
11 |
> > running, the temperature returns to what I hope is a normal 55C. Those |
12 |
> > seven projects are supposed to use the GPU, but I'm not sure whether |
13 |
> > they do in fact. |
14 |
> > |
15 |
> > Is there any way I can monitor what is using the GPU, to find out? |
16 |
> |
17 |
> I don't know if there's a utility for consumer level cards that can do |
18 |
> this. I do remember for Nvidia there's nvidia-smi but I don't think it |
19 |
> will list processes for desktop cards. |
20 |
|
21 |
This isn't consumer grade (look it up in your local shops ;-) ): |
22 |
|
23 |
# lspci -v -s 01:00.0 |
24 |
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] |
25 |
Ellesmere [Radeon Pro WX 5100] (prog-if 00 [VGA controller]) |
26 |
Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon |
27 |
Pro WX 5100] |
28 |
Flags: bus master, fast devsel, latency 0, IRQ 34, NUMA node 0 |
29 |
Memory at c0000000 (64-bit, prefetchable) [size=256M] |
30 |
Memory at d0000000 (64-bit, prefetchable) [size=2M] |
31 |
I/O ports at e000 [size=256] |
32 |
Memory at fbe00000 (32-bit, non-prefetchable) [size=256K] |
33 |
Expansion ROM at 000c0000 [disabled] [size=128K] |
34 |
Capabilities: [48] Vendor Specific Information: Len=08 <?> |
35 |
Capabilities: [50] Power Management version 3 |
36 |
Capabilities: [58] Express Legacy Endpoint, MSI 00 |
37 |
Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+ |
38 |
Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 |
39 |
Len=010 <?> |
40 |
Capabilities: [150] Advanced Error Reporting |
41 |
Capabilities: [200] #15 |
42 |
Capabilities: [270] #19 |
43 |
Capabilities: [2b0] Address Translation Service (ATS) |
44 |
Capabilities: [2c0] Page Request Interface (PRI) |
45 |
Capabilities: [2d0] Process Address Space ID (PASID) |
46 |
Capabilities: [320] Latency Tolerance Reporting |
47 |
Capabilities: [328] Alternative Routing-ID Interpretation (ARI) |
48 |
Capabilities: [370] L1 PM Substates |
49 |
Kernel driver in use: amdgpu |
50 |
|
51 |
> The only other generic ones I can think of are cuda-z and gputop. Have |
52 |
> you tried one of those? Although I don't think it'll give you the |
53 |
> information you need either. |
54 |
|
55 |
As it's AMD, not nVidia, nvidia-smi and cuda aren't suitable. I hadn't heard |
56 |
of GPU Top - thanks. I'll have a look at it. |
57 |
|
58 |
I forgot to add that I'm using the proprietary dev-libs/amdgpu-pro-opencl |
59 |
because mesa hasn't caught up yet. |
60 |
|
61 |
-- |
62 |
Regards, |
63 |
Peter. |