Gentoo Archives: gentoo-user

From: David Haller <gentoo@×××××××.de>
To: gentoo-user@l.g.o
Subject: Re: [gentoo-user] media-sound/podracer doesn't name downloaded podcasts
Date: Sun, 25 Feb 2018 22:35:14
Message-Id: 20180225223454.hfbxsjuxmyprrqek@grusum.endjinn.de
In Reply to: Re: [gentoo-user] media-sound/podracer doesn't name downloaded podcasts by Stroller
1 Hello,
2
3 On Sun, 25 Feb 2018, Stroller wrote:
4 >> On 25 Feb 2018, at 21:00, David Haller <gentoo@×××××××.de> wrote:
5 >> ...
6 >>> I would have assumed the podcast feed (RSS or whatever?) would
7 >>> contain both the link to the episode, with a filename like this, and
8 >>> also a human readable name, such as "Episode #566 - The Zoo Economy".
9 >>
10 >> $ youtube-dl --download-archive .yt-dl-archive -f mpeg 'https://www.npr.org/rss/podcast.php?id=510289'
11 [..]
12 >> $
13 >>
14 >> The --download-archive .yt-dl-archive records the already-downloaded
15 >> media (their source (npr) and ids) in the file .yt-dl-archive in the
16 >> current directory (adjust path to your liking...)
17 >
18 >That's miles better, thanks.
19
20 I forgot: I had to add the '-f mpeg' option+arg, which normally is not
21 neccessary with youtube-dl, but I got this error,
22 (probably because I specified some formats in the config):
23
24 $ yt-dl 'https://www.npr.org/rss/podcast.php?id=510289'
25 [generic] podcast: Requesting header
26 WARNING: Falling back on generic information extractor.
27 [generic] podcast: Downloading webpage
28 [generic] podcast: Extracting information
29 [download] Downloading playlist: Planet Money
30 [generic] playlist Planet Money: Collected 300 video ids (downloading 300 of them)
31 [download] Downloading video 1 of 300
32 [generic] 20180223_pmoney_pmpod826: Requesting header
33 [redirect] Following redirect to https://16543.mc.tritondigital.com:443/NPR_510289/media-session/6e22c004-4ae9-44ef-a8f3-fb8b7d1175fd/anon.npr-mp3/npr/pmoney/2018/02/20180223_pmoney_pmpod826.mp3?orgId=1&d=1227&p=510289&story=588345420&t=podcast&e=588345420&ft=pod&f=510289
34 [generic] 20180223_pmoney_pmpod826: Requesting header
35 ERROR: requested format not available
36
37 So, I ran it with '-F' to check available formats:
38
39 $ yt-dl -F 'https://www.npr.org/rss/podcast.php?id=510289'
40 [generic] podcast: Requesting header
41 WARNING: Falling back on generic information extractor.
42 [generic] podcast: Downloading webpage
43 [generic] podcast: Extracting information
44 [download] Downloading playlist: Planet Money
45 [generic] playlist Planet Money: Collected 300 video ids (downloading 300 of them)
46 [download] Downloading video 1 of 300
47 [generic] 20180223_pmoney_pmpod826: Requesting header
48 [redirect] Following redirect to https://16543.mc.tritondigital.com:443/NPR_510289/media-session/0f30aa0c-1995-432c-b33a-f431b41e2155/anon.npr-mp3/npr/pmoney/2018/02/20180223_pmoney_pmpod826.mp3?orgId=1&d=1227&p=510289&story=588345420&t=podcast&e=588345420&ft=pod&f=510289
49 [generic] 20180223_pmoney_pmpod826: Requesting header
50 [info] Available formats for 20180223_pmoney_pmpod826:
51 format code extension resolution note
52 mpeg mp3 audio only
53 [download] Downloading video 2 of 300
54 ^C
55 ERROR: Interrupted by user
56
57 Relevant:
58
59 [info] Available formats for 20180223_pmoney_pmpod826:
60 format code extension resolution note
61 mpeg mp3 audio only
62
63 So, there I went with '-f mpeg' ;)
64
65 >> And 'https://www.npr.org/rss/podcast.php?id=510289' is the url you get
66 >> if you subscribe to the podcast.
67 >
68 >Where did you find this, please? On NPR's site? I seem to be finding loads of different URLs for it on there.
69
70 I went to: https://www.npr.org/sections/money/ clicked on "subscribe
71 to podcast" at the top under the section-logo which took me to
72 https://www.npr.org/sections/money/ and because I'm "paranoid", I
73 looked at the sourcecode of the big blue "Subscribe" button and found
74 this:
75
76 ====
77 <li class="subscribe" data-sharepop="" data-selector=".sharepop.subscribe">
78 <button data-metrics="{&quot;category&quot;:&quot;podcast directory&quot;,&quot;action&quot;:&quot;toggle subscribe menu&quot;}">subscribe<b> to podcast</b></button>
79 </li>
80 <div class="sharepop-four">
81 [..]
82 <li>
83 <a target="_blank" href="https://www.npr.org/templates/rss/podcast.php?id=510289" data-metrics="...">RSS link</a></li>
84 </ul>
85 </div>
86 </div>
87 ====
88
89 Ok, I activated JS now and clicked on the button, which is a drop-down
90 and the last is the above "RSS link" and clicking on it took me to the
91 rss feed https://www.npr.org/rss/podcast.php?id=510289, same as
92 opening the url from the source-code.
93
94 Seamonkey (and Thunderbird?) shows a "subscribe" feed UI, but looking
95 at the "source" of that, I get a pretty normal RSS file listing the
96 info and URLs to the mp3 streams.
97
98 Parsing the rss yourself would not be that hard either ;) But all the
99 bookeeping of what you already have, handling renaming, etc...
100
101 HTH,
102 -dnh
103
104 --
105 Machine Always Crashes, If Not, The Operating System Hangs (MACINTOSH)
106 -- Topic on #Linux