incubator-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Noel J. Bergman" <>
Subject [VOTE] Nutch
Date Wed, 05 Jan 2005 07:08:38 GMT
Doug proposed this a week and change ago (bad timing to do such things
around major holidays :-)).  So far we have support from Dain, Nicola Ken,
Doug, Eric Hatcher, Henning, Roy and myself.

Roy, Nicola Ken, and myself provide the minimum 3 +1 from the PMC based upon
the current roster.

	--- Noel


(0) rationale

Nutch is web search software. It builds on the Apache Lucene search library,
adding a crawler, web database (including full link graph), plugins for
various document formats, user interface, etc. It is currently used by sites
such as,, and
the Internet Archive.

Nutch is a two-year-old open source project, currently hosted at Sourceforge
and backed by its own non-profit organization. The non-profit was founded in
order to assign copyright, so that we could retain the right to change the
license. We have now determined that the Apache license is the appropriate
license for Nutch and no longer require the overhead of an independent
non-profit organization. Nutch's board of directors and its developers have
both been polled and support a move to the Apache foundation.

We anticipate that Nutch will join the recently proposed
top-level project, with Lucene and its various ports.

(0.1) criteria


Nutch's developers are already comfortable operating as a meritocracy.
Nutch's current developer policies are a bit more informal than that of
Apache, but, then, there have never been any notable conflicts to resolve.


Nutch has an established and active developer community.

Core Developers:

Nutch has four active committers who are experienced open source developers.


Nutch currently users the following Apache projects: Ant, Lucene, Xerces,
POI, commons.

(0.2) warning signs

Orphaned products:

Nutch is not an orphan. It has the same corporate sponsors that it has
always had.

Inexperience with open source:

Nutch's committers are experienced with open source.

Homogenous developers:

Nutch's committers do not all share an employer or nation. All decisions are
made openly on public mailing lists.

Reliance on salaried developers:

Nutch has no salaried developers.

No ties to other Apache products:

Nutch has strong ties to Lucene.

A fascination with the Apache brand:

Nutch has a strong brand already. While the Apache brand will enhance that,
that is not a primary motivation for Nutch to join Apache.

(1) scope of the subprojects

All code is currently licensed under a variant of the Apache License 1.0.
The developers have approved a move to the Apache 2.0 license and a
re-assignment of copyright to the Apache Foundation. We have signed
Contributor License Agreements on file for all developers.

(3) identify the ASF resources to be created

(3.1) mailing list(s)

(3.2) Subversion or CVS repositories
(3.3) Jira

Nutch (NUTCH)
(4) identify the initial set of committers

Doug Cutting (Lucene committer)
Michael Cafarella (current Nutch committer at Sourceforge)
Andrzej Bialecki (current Nutch committer at Sourceforge)
John Xing (current Nutch committer at Sourceforge)
Sami Siren (current Nutch committer at Sourceforge)
(5) identify apache sponsoring individual

Erik Hatcher, Champion and Mentor
Doug Cutting, Mentor

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message