incubator-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pierre Smits <pierresm...@apache.org>
Subject Re: [VOTE] Accept Pinot into Apache Incubator
Date Sun, 11 Mar 2018 11:51:52 GMT
+1



Best regards,

Pierre Smits

V.P. Apache Trafodion

On Sun, Mar 11, 2018 at 12:59 AM, Julian Hyde <jhyde@apache.org> wrote:

> +1 (binding)
>
> Ironic that Druid — a similar project — has just entered incubation too.
> But of course that is not a conflict. Both are great projects. Good luck!
>
> Julian
>
>
> > On Mar 9, 2018, at 7:37 PM, Carl Steinbach <cws@apache.org> wrote:
> >
> > +1 (binding)
> >
> > On Fri, Mar 9, 2018, 7:29 PM kishore g <g.kishore@gmail.com> wrote:
> >
> >> Added Jim Jagielski to the mentor's list.
> >>
> >> On Fri, Mar 9, 2018 at 6:35 PM, Olivier Lamy <olamy@apache.org> wrote:
> >>
> >>> +1
> >>>
> >>> On 9 March 2018 at 17:11, kishore g <g.kishore@gmail.com> wrote:
> >>>
> >>>> Hi all,
> >>>>
> >>>> I would like to call a VOTE to accept Pinot into the Apache Incubator.
> >>> The
> >>>> full proposal is available on the wiki
> >>>> <https://wiki.apache.org/incubator/PinotProposal>
> >>>>
> >>>> Please cast your vote:
> >>>>
> >>>>  [ ] +1, bring Pinot into Incubator
> >>>>  [ ] +0, I don't care either way,
> >>>>  [ ] -1, do not bring Pinot into Incubator, because...
> >>>>
> >>>> The vote will open at least for 72 hours and only votes from the
> >>> Incubator
> >>>> PMC are binding.
> >>>>
> >>>> Thanks,
> >>>> Kishore G
> >>>>
> >>>> Discussion thread:
> >>>> https://lists.apache.org/thread.html/8119f9478ea1811371f1bf6685290b
> >>>> 22b57b1a3e0849d1d778d77dcb@%3Cgeneral.incubator.apache.org
> >>>>
> >>>>
> >>>> = Pinot Proposal =
> >>>>
> >>>> == Abstract ==
> >>>>
> >>>> Pinot is a distributed columnar storage engine that can ingest data
in
> >>>> real-time and serve analytical queries at low latency. There are two
> >>> modes
> >>>> of data ingestion - batch and/or realtime. Batch mode allows users to
> >>>> generate pinot segments externally using systems such as Hadoop. These
> >>>> segments can be uploaded into Pinot via simple curl calls. Pinot can
> >>> ingest
> >>>> data in near real-time from streaming sources such as Kafka. Data
> >>> ingested
> >>>> into Pinot is stored in a columnar format. Pinot provides a SQL like
> >>>> interface (PQL) that supports filters, aggregations, and group by
> >>>> operations. It does not support joins by design, in order to guarantee
> >>>> predictable latency. It leverages other Apache projects such as
> >>> Zookeeper,
> >>>> Kafka, and Helix, along with many libraries from the ASF.
> >>>>
> >>>> == Proposal ==
> >>>>
> >>>> Pinot was open sourced by LinkedIn and hosted on GitHub. Majority of
> >> the
> >>>> development happens at LinkedIn with other contributions from Uber and
> >>>> Slack. We believe that being a part of Apache Software Foundation will
> >>>> improve the diversity and help form a strong community around the
> >>> project.
> >>>>
> >>>> LinkedIn submits this proposal to donate the code base to Apache
> >> Software
> >>>> Foundation. The code is already under Apache License 2.0.  Code and
> the
> >>>> documentation are hosted on Github.
> >>>> * Code: http://github.com/linkedin/pinot
> >>>> * Documentation: https://github.com/linkedin/pinot/wiki
> >>>>
> >>>>
> >>>> == Background ==
> >>>>
> >>>> LinkedIn, similar to other companies, has many applications that
> >> provide
> >>>> rich real-time insights to members and customers (internal and
> >> external).
> >>>> The workload characteristics for these applications vary a lot. Some
> >>>> internal applications simply need ad-hoc query capabilities with
> >>> sub-second
> >>>> to multiple seconds latency. But external site facing applications
> >>> require
> >>>> strong SLA even very high workloads. Prior to Pinot, LinkedIn had
> >>> multiple
> >>>> solutions depending on the workload generated by the application and
> >> this
> >>>> was inefficient. Pinot was developed to be the one single platform
> that
> >>>> addresses all classes of applications. Today at LinkedIn, Pinot powers
> >>> more
> >>>> than 50 site facing products with workload ranging from few queries
> per
> >>>> second to 1000’s of queries per second while maintaining the 99th
> >>>> percentile latency which can be as low as few milliseconds. All
> >> internal
> >>>> dashboards at LinkedIn are powered by Pinot.
> >>>>
> >>>> == Rationale ==
> >>>>
> >>>> We believe that requirement to develop rich real-time analytic
> >>> applications
> >>>> is applicable to other organizations. Both Pinot and the interested
> >>>> communities would benefit from this work being openly available.
> >>>>
> >>>> == Current Status ==
> >>>>
> >>>> Pinot is currently open sourced under the Apache License Version 2.0
> >> and
> >>>> available at github.com/linkedin/pinot. All the development is done
> >>> using
> >>>> GitHub Pull Requests. We cut releases on a weekly basis and deploy it
> >> at
> >>>> LinkedIn. mp-0.1.468 is the latest release tag that is deployed in
> >>>> production.
> >>>>
> >>>> == Meritocracy ==
> >>>>
> >>>> Following the Apache meritocracy model, we intend to build an open and
> >>>> diverse community around Pinot. We will encourage the community to
> >>>> contribute to discussion and codebase.
> >>>>
> >>>> == Community ==
> >>>>
> >>>> Pinot is currently used extensively at LinkedIn and Uber. Several
> >>> companies
> >>>> have expressed interest in the project. We hope to extend the
> >> contributor
> >>>> base significantly by bringing Pinot into Apache.
> >>>>
> >>>> == Core Developers ==
> >>>>
> >>>> Pinot was started by engineers at LinkedIn, and now has committers
> from
> >>>> Uber.
> >>>>
> >>>> == Alignment ==
> >>>>
> >>>> Apache is the most natural home for taking Pinot forward. Pinot
> >> leverages
> >>>> several existing Apache Projects such as Kafka, Helix, Zookeeper, and
> >>> Avro.
> >>>> As Pinot gains adoption, we plan to add support for the ORC and
> Parquet
> >>>> formats, as well as adding integration with Yarn and Mesos.
> >>>>
> >>>> == Known Risks ==
> >>>>
> >>>> === Orphaned Products ===
> >>>>
> >>>> The risk of the Pinot project being abandoned is minimal. The teams
at
> >>>> LinkedIn and Uber are highly incentivized to continue development of
> >>> Pinot
> >>>> as it is a critical part of their infrastructure.
> >>>>
> >>>> === Inexperience with Open Source ===
> >>>>
> >>>> Post open sourcing, Pinot was completely developed on GitHub. All the
> >>>> current developers on Pinot are well aware of the open source
> >> development
> >>>> process. However, most of the developers are new to the Apache
> process.
> >>>> Kishore Gopalakrishna, one of the lead developers in Pinot, is VP and
> >>>> committer of the Apache Helix project.
> >>>>
> >>>> === Homogenous Developers ===
> >>>>
> >>>> The current core developers are all from LinkedIn and Uber. However,
> we
> >>>> hope to establish a developer community that includes contributors
> from
> >>>> several corporations and we are actively encouraging new contributors
> >> via
> >>>> the mailing lists and public presentations of Pinot.
> >>>>
> >>>> === Reliance on Salaried Developers ===
> >>>>
> >>>> It is expected that Pinot development will occur on both salaried time
> >>> and
> >>>> on volunteer time, after hours. The majority of initial committers are
> >>> paid
> >>>> by their employer to contribute to this project. However, they are all
> >>>> passionate about the project, and we are confident that the project
> >> will
> >>>> continue even if no salaried developers contribute to the project. We
> >> are
> >>>> committed to recruiting additional committers including non-salaried
> >>>> developers.
> >>>>
> >>>> === Relationships with Other Apache Products ===
> >>>>
> >>>> As mentioned earlier, Pinot uses several Apache Projects such as Kafka
> >> to
> >>>> ingest data in real-time, Zookeeper and Helix for cluster management.
> >>> Pinot
> >>>> also uses Maven for build and release. We foresee adding support for
> >> the
> >>>> Parquet and ORC formats. Adding the ability to deploy on Yarn and
> Mesos
> >>>> clusters is another interesting project we might pursue.
> >>>>
> >>>> === An Excessive Fascination with the Apache Brand ===
> >>>>
> >>>> While we respect the reputation of the Apache brand and have no doubts
> >>> that
> >>>> it will attract contributors and users, we believe ASF is the right
> >> home
> >>>> for Pinot to foster a great community that will lead to a better
> >> outcome
> >>> in
> >>>> the long term.
> >>>>
> >>>> == Documentation ==
> >>>>
> >>>> * Code: https://github.com/linkedin/pinot/
> >>>> * Documentation: https://github.com/linkedin/pinot/wiki
> >>>> * User group: https://groups.google.com/forum/#!forum/pinot_users
> >>>>
> >>>> == Initial Source ==
> >>>>
> >>>> The current Pinot codebase is hosted on Github and licensed under the
> >>>> Apache License V2. The source tree is self contained and relies on
> >> Maven
> >>> as
> >>>> its build and dependency resolution mechanism.
> >>>>
> >>>> == External Dependencies ==
> >>>>
> >>>> All dependencies in Pinot have licenses that are compatible with
> Apache
> >>>> License V2, except for the org.json library, which will be removed
> >> prior
> >>> to
> >>>> Apache incubation. The list below summarizes the external dependencies
> >> of
> >>>> Pinot grouped by license and ASF license category.
> >>>>
> >>>> Dependencies from the ASF Category A
> >>>> === Apache License 2.0 ===
> >>>> * com.101tec:zkclient:0.7
> >>>> * com.alibaba:fastjson:1.1.24
> >>>> * com.clearspring.analytics:stream:2.7.0
> >>>> * com.fasterxml.jackson.core:jackson-annotations:2.8.0
> >>>> * com.fasterxml.jackson.core:jackson-core:2.8.0
> >>>> * com.fasterxml.jackson.core:jackson-databind:2.8.0
> >>>> * com.google.code.findbugs:jsr305:3.0.0
> >>>> * com.google.guava:guava:19
> >>>> * com.ning:async-http-client:1.9.21
> >>>> * com.yammer.metrics:metrics-core:2.2.0
> >>>> * commons-beanutils:commons-beanutils:1.8.3
> >>>> * commons-cli:commons-cli:1.2
> >>>> * commons-codec:commons-codec:1.6
> >>>> * commons-configuration:commons-configuration:1.6
> >>>> * commons-fileupload:commons-fileupload:1.2.2
> >>>> * commons-httpclient:commons-httpclient:3.1
> >>>> * commons-io:commons-io:2.1
> >>>> * commons-validator:commons-validator:1.4.0
> >>>> * io.netty:netty-all:4.1.4.Final
> >>>> * io.swagger:swagger-jaxrs:1.5.10
> >>>> * io.swagger:swagger-jersey2-jaxrs:1.5.10
> >>>> * it.unimi.dsi:fastutil:6.5.16
> >>>> * joda-time:joda-time:2
> >>>> * log4j:log4j:1.2.17
> >>>> * me.lemire.integercompression:JavaFastPFOR:0.0.13
> >>>> * nl.jqno.equalsverifier:equalsverifier:1.7.2
> >>>> * org.apache.avro:avro:1.7.6
> >>>> * org.apache.commons:commons-compress:1.9
> >>>> * org.apache.commons:commons-lang3:3.5
> >>>> * org.apache.commons:commons-math:2.1
> >>>> * org.apache.hadoop:hadoop-client:2.7.0
> >>>> * org.apache.hadoop:hadoop-common:2.7.0
> >>>> * org.apache.helix:helix-core:0.6.8
> >>>> * org.apache.httpcomponents:httpclient:4.1.3
> >>>> * org.apache.httpcomponents:httpclient:4.2.5
> >>>> * org.apache.httpcomponents:httpcore:4.2.5
> >>>> * org.apache.httpcomponents:httpmime:4.2.5
> >>>> * org.apache.kafka:kafka_2.10:0.9.0.1
> >>>> * org.apache.thrift:libthrift:0.9.1
> >>>> * org.apache.zookeeper:zookeeper:3.4.9
> >>>> * org.codehaus.jackson:jackson-core-asl:1.9.6
> >>>> * org.codehaus.jackson:jackson-mapper-asl:1.9.6
> >>>> * org.json:json:20080701
> >>>> * org.roaringbitmap:RoaringBitmap:0.5.10
> >>>> * org.testng:testng:6.0.1
> >>>> * org.twitter4j:twitter4j-core:4.0.3
> >>>> * org.webjars:swagger-ui:2.2.2
> >>>> * org.xerial.larray:larray:0.2.1
> >>>> * org.yaml:snakeyaml:1.16
> >>>> * xml-apis:xml-apis:1.0.b2
> >>>> === Dual license (Apache License 2.0 + LGPL 2.1), using under the
> >> Apache
> >>>> License ===
> >>>> * org.codehaus.jackson:jackson-jaxrs:1.9.6
> >>>> * org.codehaus.jackson:jackson-xc:1.9.6
> >>>> === BSD ===
> >>>> * com.jcabi:jcabi-log:0.17.1
> >>>> * org.antlr:antlr4-annotations:4.3
> >>>> * org.antlr:antlr4-runtime:4.3
> >>>> === MIT ===
> >>>> * com.github.nkzawa:socket.io-client:0.5.1
> >>>> * org.mockito:mockito-core:2.10.0
> >>>> * org.slf4j:slf4j-api:1.7.7
> >>>> * org.slf4j:slf4j-log4j12:1.7.7
> >>>>
> >>>> === Dependencies from the ASF Category B ===
> >>>> Dual license (CDDL 1.1 + GPL 2 w/ CPE), using under the CDDL
> >>>> * com.sun.jersey:jersey-client:1.19.2
> >>>> * javax.servlet:javax.servlet-api:3.0.1
> >>>> * org.glassfish.jersey.containers:jersey-container-grizzly2-http:2.23
> >>>> * org.glassfish.jersey.core:jersey-common:2.23
> >>>> * org.glassfish.jersey.core:jersey-server:2.23
> >>>> * org.glassfish.jersey.media:jersey-media-json-jackson:2.24
> >>>> * org.glassfish.jersey.media:jersey-media-multipart:2.23
> >>>>
> >>>> === Dependencies from the ASF Category X ===
> >>>> JSON License
> >>>> * org.json:json:20080701 (to be removed before Apache incubation)
> >>>>
> >>>>
> >>>> == Cryptography ==
> >>>>
> >>>> None
> >>>>
> >>>> == Required Resources ==
> >>>>
> >>>> === Mailing lists ===
> >>>>
> >>>> * pinot-private (with moderated subscriptions)
> >>>> * pinot-user
> >>>> * pinot-dev
> >>>> * pinot-commits
> >>>>
> >>>> === Git repository ===
> >>>>
> >>>> * git://git.apache.org/pinot
> >>>> * https://git-wip-us.apache.org/repos/asf/incubator-pinot.git
> >>>>
> >>>> === Issue Tracking ===
> >>>>
> >>>> A JIRA Issue tracker (PINOT)
> >>>>
> >>>> === Other Resources ===
> >>>>
> >>>> The existing code already has unit and integration tests and we use
> >>> travis
> >>>> to test the patch before committing it to master. We would like to
> have
> >>> an
> >>>> instance of Jenkins to achieve similar functionality.
> >>>>
> >>>> == Initial Committers ==
> >>>>
> >>>> * Kishore Gopalakrishna
> >>>> * Ravi Aringunram
> >>>> * Jean-François Im
> >>>> * Mayank Shrivastava
> >>>> * Subbu Subramaniam
> >>>> * Adwait Tumbde
> >>>> * Xiaotian Jiang
> >>>> * Jennifer Dai
> >>>> * Seunghyun Lee
> >>>> * Xiang Fu
> >>>> * Dhaval Patel
> >>>> * Neha Pawar
> >>>> * Alex Pucher
> >>>> * Yen-Jung Chang
> >>>>
> >>>>
> >>>>
> >>>> == Affiliations  ==
> >>>>
> >>>> * Kishore Gopalakrishna (LinkedIn)
> >>>> * Ravi Aringunram (LinkedIn)
> >>>> * Jean-François Im (LinkedIn)
> >>>> * Mayank Shrivastava (LinkedIn)
> >>>> * Subbu Subramaniam (LinkedIn)
> >>>> * Adwait Tumbde (LinkedIn)
> >>>> * Xiaotian Jiang (LinkedIn)
> >>>> * Jennifer Dai (LinkedIn)
> >>>> * Seunghyun Lee (LinkedIn)
> >>>> * Xiang Fu (Uber)
> >>>> * Dhaval Patel (Uber)
> >>>> * Neha Pawar (LinkedIn)
> >>>> * Alex Pucher (LinkedIn)
> >>>> * Yen-Jung Chang (LinkedIn)
> >>>> * Marcel Siegrist
> >>>>
> >>>> == Sponsors ==
> >>>>
> >>>> === Champion ===
> >>>>
> >>>> * Olivier Lamy < olamy at apache dot org>
> >>>>
> >>>> === Nominated Mentors ===
> >>>>
> >>>> * Olivier Lamy <olamy at apache dot org>
> >>>> * Kishore Gopalakrishna < kishoreg at apache dot org>
> >>>> *
> >>>>
> >>>> === Sponsoring Entity ===
> >>>>
> >>>> The Apache Incubator
> >>>>
> >>>
> >>>
> >>>
> >>> --
> >>> Olivier Lamy
> >>> http://twitter.com/olamy | http://linkedin.com/in/olamy
> >>>
> >>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> For additional commands, e-mail: general-help@incubator.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message