Gentoo Archives: gentoo-soc

From: Brian Dolbec <brian.dolbec@×××××.com>
To: gentoo-soc@l.g.o
Subject: Re: [gentoo-soc] GSoC - Package statistics
Date: Tue, 22 Mar 2011 00:52:15
Message-Id: 1300755118.31104.110.camel@big_daddy.dol-sen.ca
In Reply to: [gentoo-soc] GSoC - Package statistics by Michael Seifert
1 On Tue, 2011-03-22 at 00:39 +0100, Michael Seifert wrote:
2 > -----BEGIN PGP SIGNED MESSAGE-----
3 > Hash: SHA1
4 >
5 > Hello Gentoo team,
6 >
7 > I am interested in taking part in this year's Google Summer of Code and
8 > would like to discuss some general things. More specific, I am talking
9 > about Christopher Harvey's idea to design an application that raises
10 > statistics on the installed packages on a system (see [1]).
11 >
12 > Let me explain my issue:
13 > In the first place, I am a Java developer and I consider my Java skills
14 > to be quite well. The point is that java is not part of a minimal gentoo
15 > system, whereas Python is. I would not have problems with Python, but I
16 > would have to learn the API of a GUI toolkit (preferably PyQt).
17 > What is your opinion: Would you be okay with a Java application or would
18 > you rather like to have a Python based one or even some other language?
19 > And finally, how much time do you think will it cost to work through the
20 > Qt API?
21 >
22 >
23
24 Well, a good deal of the info gathering methods are already coded in
25 python (by me) and installed on many users systems already, soon to be
26 even more as gentoolkit-0.3.0 final is about to be released.
27
28 Have a look at the analyse module. It is designed for accessing the
29 installed package database and reporting/repairing things about it.
30 There is room for many more types of reports depending what is needed.
31 Adding an anonymous data upload should not be difficult. So far I have
32 only been adding report types for problems users have run into. It has
33 USE flags and keywords reports so far. It has an api to use so as to
34 not need terminal output parsing to get the relevant data out of it.
35 Also so does most of equery and eclean have usable api's now (if there
36 is anything needed from there (the reason I got involved in gentoolkit
37 coding).
38
39 I don't think a gui would be needed, but rather webapp interfaces to the
40 data gathered. Most of the devs are cli die hards, so a simple command
41 line interface to query the central database should be primary. I
42 believe it would be more widely used for the ebuilds they maintain. A
43 browser could be used to connect to it and get graphs, etc. for more
44 elaborate info displays.
45
46 I would think the main portion of this project would be the database and
47 webapp query tools. Potential controversy aside, I would also think you
48 should add a mechanism for a dev to trigger a simple query for people
49 having a cat/pkg-ver installed to optionally fill in a few questions
50 regarding an pkg's stability,... all anonymously. Those queries could
51 be generic in nature and come with the data gathering tool, that way
52 only a simple small string need be downloaded at the time of data upload
53 (for ex: pkgname, query #, possibly a bug #) no executable code.
54
55 So I would concentrate your research and proposal efforts on the
56 database and webapps.
57 --
58 Brian Dolbec <brian.dolbec@×××××.com>

Attachments

File name MIME type
signature.asc application/pgp-signature

Replies

Subject Author
Re: [gentoo-soc] GSoC - Package statistics chris@××××××××××××.com