Linux: Measuring Kernel Commits

Submitted by Jeremy
on October 14, 2003 - 6:41am

BitKeeper creator Larry McVoy [interview] recently commented on the sheer quantity of work going on in the Linux kernel. He summarizes:

"What that means is that in about a year, you've managed to create 65,337 changesets. That's 179 per day, 7.4/hour, 24x7. You guys are busy. To put that in perspective, the most active project on sourceforge today, Gaim, has 805 commits to its changelog. Over 3.5 years. That means you are changing your source base 284x more often than they are. And that's just the BK users, that doesn't count the people not using BK, which are a substantial fraction."

Larry posted similar statistics a little over a year ago [story] which also showed a tremendous rate of activity. Read on for the full email.


From: Larry McVoy [email blocked]
To: linux-kernel [email blocked]
Subject: Silly BK statistics
Date: Mon, 13 Oct 2003 19:55:51 -0700

You guys work way too hard.  The BK openlogging tree, which has all
changesets ever made by anyone in the Linux kernel, was getting big.
Really big.  The nodes in the graph have internal "serial numbers"
which are currently 16 bits, i.e., there can't be more than 64K nodes
in the graph.

I sent mail to some of my engineers this morning saying "hey, I suspect
the Linux openlogging tree is about overflow, we need to go to 32 bit
ser_t's."  I had no idea how close we were, I just knew it was a problem
we needed to solve.

I just got mail from one of the team which reads: "With 199 serials shy
of overflowing , the 32 bit version is now installed".

What that means is that in about a year, you've managed to create 65,337
changesets.  That's 179 per day, 7.4/hour, 24x7.  You guys are busy.

To put that in perspective, the most active project on sourceforge today,
Gaim, has 805 commits to its changelog.  Over 3.5 years.  That means you
are changing your source base 284x more often than they are.  And that's 
just the BK users, that doesn't count the people not using BK, which are
a substantial fraction.

No matter how you slice it, it is pretty amazing rate of change.
If change is good, you guys rock, I've never seen anything like it.
-- 
---
Larry McVoy              lm at bitmover.com          http://www.bitmover.com/lm

Related Links:

Measuring commits to the chan

schneelocke
on
October 14, 2003 - 11:56am

Measuring commits to the changelog may not do Gaim justice. The sourceforge project page at http://sf.net/projects/gaim says, for example:

" CVS Repository ( 13,529 commits, 3,435 adds )"

--
schnee

Directly comparable?

molo
on
October 14, 2003 - 12:41pm

I'm not sure that is directly comparable. CVS commits and adds are per-file. BK changesets (I believe) are cross-file.

-molo

Yes, it is not directly compa

schneelocke
on
October 14, 2003 - 1:45pm

Yes, it is not directly comparable. But then, changelog commits and kitbeeper changesets aren't, either; and, for that matter, Gaim is not even the most active project on sourceforge, anyway, if you measure by CVS commits etc. (for example, the "compiere" project has: "CVS Repository ( 28,022 commits, 4,476 adds )").

FWIW, IIRC, the "most active" list on Sourceforge is based on pageviews and downloads rather than project CVS activity, anyway.

--
schnee

Impressive

Anonymous
on
October 14, 2003 - 12:03pm

It's impressive they can do that while still keeping things in relatively working condition.

It would be very interesting

Anonymous
on
October 14, 2003 - 12:15pm

It would be very interesting to compare theses numbers with some big open source project, such as Mozilla. Gaim has much less source code, and is much less used than the Linux kernel, so it's not the same category.

KDE numbers

Anonymous
on
October 18, 2003 - 5:22pm

Each week there are an average of 2000 commits. That is multifile atomic commits, not counting each file change.

Again, not changesets. I believe that changesets are usually a bit larger. Size of each commit ranges from one liner bug fixes to major feature implementations. The total also includes i18n translations. And all the applications that are in the KDE repository, ie. Koffice, etc.

Still, an impressive amount of work.

Derek
KDE-CVS-Digest

graph

andersg
on
October 14, 2003 - 1:23pm

Btw. I graphed this with rrdtool the other day:

http://people.0x63.nu/~andersg/lk-csets.png

linux on CIA

Anonymous
on
October 15, 2003 - 10:15pm

The CIA bot gets real-time commit notification from both linux and gaim, so you can compare them on its stats page.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.