Gentoo Archives: gentoo-dev

From: "Michał Górny" <mgorny@g.o>
To: gentoo-dev@l.g.o
Subject: [gentoo-dev] Stats: Gentoo developer commit timeline
Date: Mon, 16 Jan 2017 09:35:47
Hi, everyone.

Just a quick side project we've done a while ago. It's a timeline of
developer commit activity [1]. Code for data processing in [2]. I did
the data, Amynka prepared a nice JS to graph it.

This version is running purely on git + historical cvs->git conversion.
Dates are done using committer timestamp (so they may shift a little
when people fail to set times), authors are mapped by author +
committer emails. This means that two people can get one commit
attributed -- since the goal is only to check activity, not attribute

The emails are further mapped to developer accounts either via LDAP
(mail + gentooAlias), or via stripping ''. People outside
Gentoo are skipped since it's hard to track their email changes.

The graphs are interrupted if a developer had no commits for 90 days.
This is meant to visualize returning developers better -- though I
guess it more often points out developers who rarely commit at all ;-).
Possibly it also emphasizes the cases when developers slowly lost
interest in Gentoo.

Of course, it's all approximate. Non-commit activity is not included.
Some of more recent developers are listed 'earlier' due to their
proxy-maint activity. Some of the data may be inconsistent, some of
proxy-maint work may be missing (if we don't have mapping from old
e-mail address), some developer may be duplicated (if we don't have
mapping for their username changes).

To avoid any disputable proxy-maint data, I could go for committer
addresses only + --first-parent to skip over merge commits. However, I
know that a few developers failed at merge commits in the past, so that
would also skip some legitimate data.


Best regards,
Michał Górny