Gentoo Archives: gentoo-soc

From: Yanbo Wang <wybb86@×××××.com>
To: gentoo-soc@l.g.o
Subject: Re: [gentoo-soc] Package statistics reporting tool
Date: Tue, 29 Mar 2011 19:01:36
Message-Id: 4D922C6F.50700@gmail.com
In Reply to: Re: [gentoo-soc] Package statistics reporting tool by Brian Dolbec
1 Hi all, I'm a student from China. I'm interested in this idea and have
2 different ideas to share.
3
4 于 2011-3-29 22:20, Brian Dolbec 写道:
5 > On Tue, 2011-03-29 at 08:53 -0500, Donnie Berkholz wrote:
6 >> On 18:54 Tue 29 Mar , Vikraman wrote:
7 >>> I've been doing research on this project for sometime, and would
8 >>> really like my ideas to be reviewed by the community before
9 >>> submitting my application.
10 >>>
11 >>> * User runs a script as a cronjob (or a portage hook ?), which
12 >>> collects stats on installed packages, and uploads to the online
13 >>> database. Surprisingly easy to code using the gentoolkit API.
14 > :) Thanks, anything else needed in there, just ask.
15 >
16 >> Sure, one of those two options sounds good. The choice can be easily
17 >> changed later, but I prefer the idea of a hook that runs after the
18 >> package manager completes installing a package set.
19 >>
20 >> People on laptops may not run cron jobs regularly, or at all.
21 > good point.
22 >
23 >>> * Webapp generates statistics on the packages (top packages,
24 >>> versions, popularity, packages from overlays). Thanks to ideas from
25 >>> an old thread on the forums. Devs, please suggest some more useful
26 >>> ideas/stats too.
27 >> For Gentoo users:
28 >> - Search by package name
29 >> - Search by packages often installed with this one (recommendations)
30 >> - Add ratings. Maybe just via the GUI frontend for now.
31 >> - Add user reputations and profiles. How many ratings, etc. Provide the
32 >> option of logging in as user or anonymously for package uploads.
33 >>
34 >>
35 I think the statistics about relationship between packages is more
36 important, because if the packages installed together run well and
37 appear together frequently, we can recommend one package to the user
38 when the other package is installed. and if the packages installed
39 together usually crash, there may be some problems between them.
40 So there can be more types of deep statistics that provide more useful
41 information.
42 >> Tools for Gentoo developers:
43 >> - Least popular packages
44 >> - Search by maintainer/herd
45 >> - Integrate with Bugzilla by showing # of bugs& linking to the list
46 >> - Integrate w/ BZ by showing # of tinderbox-related bugs
47 >>
48 >> For example, a useful search would be ≤20 users, package assigned to
49 >> maintainer-needed or maintainer wanted, and ≥1 open bugs.
50 >>
51 >>> * Dev/user runs a cli script to read stats or a gui program to
52 >>> display graphs.
53 >>>
54 >>> I'm looking forward to coding the project in Python, gui using
55 >>> PyGTK, database using MySQL. For the web service implementation,
56 >>> should I use SOAP (soappy, which I'll need to learn) or something
57 >>> like web.py (I've used it earlier) ?
58 >> What is the role of the GUI? Please describe the use cases and justify
59 >> why it should be a desktop app rather than a webapp.
60 > As much of a gui person as I am, I too think a web browser would be
61 > preferred connecting for this. It could also be expanded as the need or
62 > new query type comes up and then becomes available to all without the
63 > need to upgrade.
64 >
65 >> Consider REST for the web service rather than SOAP. There's a nice book
66 >> called "RESTful Web Services" you might like to take a look at.
67 >>
68 >>> It was also suggested earlier to allow a dev to trigger a popup
69 >>> feedback from the user. I think that would require the user ro run a
70 >>> daemon, which will listen for queries.
71 >> It doesn't necessarily have to be instant. Upon the next installation,
72 >> when uploading new stats, the client could check for messages.
73 > Yes, this was what I was trying to suggest at the time, perhaps I could
74 > have chosen better words :/
75 >
76 >>> Please suggest what details to include and what not to include in
77 >>> the application.
78 >> Many resources exist on this topic:
79 >>
80 >> - http://www.gentoo.org/proj/en/userrel/soc/applying.xml
81 >> - http://www.booki.cc/gsocstudentguide/_v/1.0/writing-a-proposal/
82 >> - http://google-opensource.blogspot.com/2011/03/dos-and-donts-of-google-summer-of-code.html
83 >>
84 >
85 The informations about installed packages are regularly uploaded to the
86 online database, but they are usually reduplicate because most of them
87 may be same. I think each host can be identified by a unique random id
88 that is used to track submissions issued by the same host. Security is
89 very important for this project, the host id should be kept secret.
90
91 --
92 Thanks,
93 Yanbo Wang

Replies

Subject Author
Re: [gentoo-soc] Package statistics reporting tool Vikraman Choudhury <vikraman.choudhury@×××××.com>