Gentoo Archives: gentoo-user

From: Pengcheng Xu <i@××××××××.moe>
To: gentoo-user@l.g.o
Subject: RE: [gentoo-user] New PC hangs/lacks ?
Date: Wed, 06 May 2020 15:26:43
Message-Id: 000e01d623ba$b93f2220$2bbd6660$@jsteward.moe
In Reply to: Re: [gentoo-user] New PC hangs/lacks ? by tuxic@posteo.de
1 Well, I think you should be able to set up some real-time monitoring that keeps collecting system status in the background and make some plot to observe if something's going wrong during the Blender workload. An easy setup would be to fire up a Prometheus and Grafana docker app, and to run some node_exporters to collect metrics on your machine in question. You may miss chances to reproduce the issue when you're looking out for them, but the monitoring system would catch it when you're not watching and things do go wrong.
2
3 Also wanted to say that sys-apps/dstat seems to be a good quick glance over what's happening in the system on different aspects in a single place. It does this nice printing to the terminal once a second:
4
5 2 1 97 0 0| 0 8192B| 292B 16k| 0 0 | 684 1662
6 2 2 96 0 0| 0 0 | 346B 17k| 0 0 | 690 1641
7 --total-cpu-usage-- -dsk/total- -net/total- ---paging-- ---system--
8 usr sys idl wai stl| read writ| recv send| in out | int csw
9 2 1 97 0 0| 0 0 | 560B 17k| 0 0 | 696 1630
10 4 3 93 0 0| 0 0 |1060B 34k| 0 0 |1368 3231
11 2 1 97 0 0| 0 0 |1112B 18k| 0 0 | 699 1675
12
13 It's easy to overlook things when you're juggling with multiple monitoring apps like htop/iotop/... while having to manage the real workload simultaneously. Maybe try it out when you're dealing with system activity problems next time.
14
15 Regards,
16 --
17 Pengcheng Xu
18 https://jsteward.moe
19
20 > -----Original Message-----
21 > From: tuxic@××××××.de <tuxic@××××××.de>
22 > Sent: Wednesday, May 6, 2020 11:10 PM
23 > To: gentoo-user@l.g.o
24 > Subject: Re: [gentoo-user] New PC hangs/lacks ?
25 >
26 > On 05/06 04:19, tuxic@××××××.de wrote:
27 > > On 05/06 07:07, Mark Knecht wrote:
28 > > > On Wed, May 6, 2020 at 3:21 AM <tuxic@××××××.de> wrote:
29 > > > >
30 > > > >
31 > > > > Hi,
32 > > > >
33 > > > > while rendering with Blender the system performance (especially
34 > > > > graphic related stuff) lacks. That's not nice but it seems that
35 > > > > this is the way it is designed.
36 > > > >
37 > > > > What makes me a little nervous are freezes of several seconds. It
38 > > > > not onlu freezes but the whole graphical interface of everything
39 > > > > locks down (I couldn't find a corona computer virus, though).
40 > > > >
41 > > > > In the Xorg log I found this:
42 > > > >
43 > > > > [ 2808.761] (WW) NVIDIA: Wait for channel idle timed out.
44 > > > >
45 > > > > which possibly match such a moment of a freeze.
46 > > > >
47 > > > > My setup:
48 > > > > Blender 2.90a (alpha) and Blender 2.83 (beta) and Blender 2.82a
49 > > > > (stable).
50 > > > > All Blender versions show the same problem.
51 > > > >
52 > > > > X11/Openbox
53 > > > >
54 > > > > NVidia 484.82 as delivered by NVidia, since the Gentoo package
55 > > > > does not install all files of the driver which are needed for
56 > > > > Blender (for example to support Optix).
57 > > > >
58 > > > > No other application, which heavily uses the GPU was running at
59 > > > > that time.
60 > > > >
61 > > > > MSI RTX 2060 SUPER
62 > > > > Ryzen 5 3600
63 > > > > 32GB RAM
64 > > > > MSI Tomahawk MAX
65 > > > >
66 > > > > Does everyone has the same problems probably already solved or any
67 > > > > idea how I can those freezes?
68 > > > >
69 > > > > Any help or idea what causes this freezes is very appreciated! :)
70 > > > >
71 > > > > Cheers!
72 > > > > Meino
73 > > > >
74 > > > >
75 > > > >
76 > > >
77 > > > Meino,
78 > > > Generically, you need to set up some sort of real-time monitoring
79 > > > and watch to see what is using CPU and/or I/O when the machine
80 > > > 'appears' to hang. I say 'appears' because the machine is probably
81 > > > running correctly but doing something other than Blender work.
82 > > >
83 > > > NOTE: You didn't say that there is or isn't any disk activity
84 > > > when this happens.
85 > > >
86 > > > When I look at this sort of problem I set up a second machine,
87 > > > ssh in with a bunch of terminals and start with 'top' and 'iotop' to
88 > > > watch for what process might be using resources. top watches CPU,
89 > > > iotop watches disk. Conceptually networking can lock up the machine
90 > > > but it's never happened to me.
91 > > >
92 > > > You can also look to see if some piece of hardware is generating
93 > > > too many interrupts. Do
94 > > >
95 > > > watch cat /proc/interrupts
96 > > >
97 > > > in a wide terminal when not running Blender to get used to what the
98 > > > machine does when idle, then run Blender and see if anything is
99 > > > going crazy generating interrupts.
100 > > >
101 > > > I hope some part of this helps you find your problem.
102 > > >
103 > > > Mark
104 > >
105 > >
106 > > Hi Mark,
107 > >
108 > > Thank you for your help and ideas.
109 > >
110 > > I will try to convince my tablet to connect to my PC and will try to
111 > > lock everyting which may point me into to the direction of the
112 > > problem.
113 > >
114 > > To complete, what I previously wrote:
115 > > No, the disk is not doing anything beside holidays in Honulu...so to
116 > > say ;)
117 > >
118 > > As soon I have something new "I'll be back"... :)
119 > >
120 > > Cheers!
121 > > Meino
122 > >
123 > >
124 >
125 > Hi,
126 >
127 > since three days these locks nags me. Now, equipped with a tablet, a ssh connection
128 > and an eye on the interrupts...
129 > ...no locks happen anymore.
130 >
131 > Grrrmpppfff...
132 >
133 > And let me guess: In the moment when I have the least of an application for
134 > those locks...then they will be back... :(
135 >
136 > Sorry, I cannot reproduce the problem anymore.
137 >
138 > Cheers!
139 > Meino
140 >
141 >
142 >

Attachments

File name MIME type
openpgp-digital-signature.asc application/pgp-signature