incubator-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From sebb <seb...@gmail.com>
Subject Re: [DISCUSS] Marvin-AI Incubator Proposal
Date Wed, 15 Aug 2018 23:16:43 GMT
On 15 August 2018 at 23:37, Luciano Resende <luckbr1975@gmail.com> wrote:
> Thanks for the heads up Sebastian, we will have that in mind and remove the
> hyphen if that's what causes the issue (e.g. using to MarvinAI)

Or just Marvin?

Note that the display name can remain as Marvin-AI, however the
project id/mailing-list should not have an embedded hyphen.

> On Wed, Aug 15, 2018 at 3:29 PM sebb <sebbaz@gmail.com> wrote:
>
>> A warning: there are some areas of Infra which assume that project
>> names don't include hyphens.
>> There is one existing project with a hyphen (Empire-db); various
>> scripts have to be special-cased to handle this.
>>
>> It would be best not to use marvin-ai as the project id (the display
>> name does not matter).
>>
>> On 15 August 2018 at 20:13, Luciano Resende <luckbr1975@gmail.com> wrote:
>> > We would like to start a discussion on accepting Marvin-AI as an Apache
>> > Incubator project.
>> >
>> > The proposal is available at the incubator wiki, and also copied below:
>> > https://wiki.apache.org/incubator/Marvin-AI
>> >
>> > As part of the initial due diligence, we have done a preliminary name
>> > search and the results are available on the JIRA below:
>> >
>> > https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-144
>> >
>> > We are also looking for two additional mentors.
>> >
>> >
>> > Thanks in advance for your time reviewing and providing feedback.
>> >
>> > ===
>> >
>> > = Marvin-AI =
>> >
>> > == Abstract ==
>> >
>> > Marvin-AI is an open-source artificial intelligence (AI) platform that
>> > helps data scientists, prototype and productionalize complex solutions
>> with
>> > a scalable, low-latency, language-agnostic, and standardized architecture
>> > while simplifies the process of exploration and modeling.
>> >
>> > == Proposal ==
>> >
>> > Marvin helps non-experienced developers create industry-grade AI
>> > applications. It has three core components:  a development environment to
>> > be used during data exploration and hypothesis validation (Toolbox), a
>> > library which should be extended to create Marvin engines, and a Scala
>> > application server which interprets engines (Engine Executor).
>> > A basic premise of Marvin is that it should be language-agnostic, able to
>> > interpret engines implemented in different programming languages.
>> >
>> > == Background ==
>> >
>> > The Marvin AI project was initiated as an internal project at B2W Digital
>> > (Brazil), the largest e-commerce company in Latin America. Nowadays, it
>> is
>> > used by all data scientists within the B2W team. Oftentimes, data
>> > scientists don't have an extensive background in software engineering,
>> yet
>> > are in charge of creating AI applications that need to scale to high
>> > throughput and provide millisecond-level response times. At B2W, Marvin
>> AI
>> > plays an important role in this process, abstracting advanced software
>> > engineering procedures, allowing data scientists to focus on their
>> > knowledge domain.
>> >
>> > == Rationale ==
>> >
>> > With recent advances in computer architecture and a corresponding
>> increase
>> > in the amount of data generated by always-connected devices, AI
>> algorithms
>> > offer a solution to problems that have long troubled modern corporations.
>> > Since AI developers come from various fields, such as statistics,
>> physics,
>> > and math, there exists a strong need for platforms which enable them to
>> > move from prototypes to enterprise applications. Although some tools
>> claim
>> > to offer this service, in reality, there is no reliable open-source
>> > solution.
>> >
>> > == Initial Goals ==
>> >
>> > The initial goals will most likely be to merge the existing codebase
>> into a
>> > single repository, migrate it to Apache, and then integrate with the
>> Apache
>> > development process. Furthermore, we plan for incremental development and
>> > releases, as per Apache guidelines.
>> >
>> > == Current Status ==
>> >
>> > === Meritocracy ===
>> >
>> > Marvin already works under principles of meritocracy. Today, Marvin
>> already
>> > has some contributors that are part of other institutions. Although there
>> > is no formal process defined to become a committer, contributors that
>> make
>> > major changes/improvements to the platform are naturally granted write
>> > access to the repository.
>> >
>> >
>> > === Community ===
>> >
>> > Acceptance into the Apache foundation would substantially boost both
>> > Marvin's user and developer communities. The current community includes a
>> > few experienced developers that have either academic or professional
>> > experience with AI. The community is largely comprised of data scientists
>> > working at B2W and other companies such as Cloudera, MIT, Qume Labs,
>> > Laguro.com, and CBYK. Also, there is a  meetup group of hundreds of users
>> > who meet regularly to exchange ideas about Marvin and, more generally,
>> AI.
>> >
>> > Reference to the group: https://www.meetup.com/marvin-ai/members/
>> >
>> > === Core Developers ===
>> >
>> > The core developers for Marvin are listed in the contributor's list and
>> > initial PPMC below. These lists include B2W employees, MIT students,
>> UFSCAR
>> > researchers, independent contributors, and some employees of other
>> > companies like Cloudera, Qume Labs, Laguro.com, and CBYK.
>> >
>> > === Alignment ===
>> >
>> > The initial committers strongly believe that by being part of the Apache
>> > Software Foundation, Marvin AI will be part of a comprehensive suite for
>> AI
>> > applications that can process big data and enable enterprises to extract
>> > value from their data lakes. Also, we hope that by integrating with other
>> > Apache projects such as Apache Spark, Apache Hadoop; that this will
>> foster
>> > additional collaboration between these projects furthering the already
>> > existing integration points and expanding the community of contributors.
>> >
>> >
>> > == Known Risks ==
>> >
>> > === Orphaned products ===
>> >
>> > Given the current maturity of Marvin and how well it has been received at
>> > technical conferences, the risk of the project being abandoned is
>> minimal.
>> > AI is not academia-exclusive anymore, and as enterprises start to add
>> > data-science pipelines to their applications, demand for Marvin will only
>> > increase.
>> >
>> > === Inexperience with Open Source ===
>> >
>> > Marvin AI has been an open-source project since October 2017. The project
>> > was started in a company where open-source culture is foundational. B2W
>> > Digital runs the largest e-commerce in Latin America on top of
>> open-source
>> > projects.
>> >
>> > === Reliance on Salaried Developers ===
>> >
>> > Marvin AI receives substantial efforts from salaried developers -- a few
>> of
>> > which were hired by companies to work exclusively for the project -- but
>> > the majority devote "after-hours" or spare time to this project. Some
>> > developers are graduate students that contribute in their free time at
>> > school.
>> >
>> > === Relationships with Other Apache Products ===
>> >
>> > Marvin integrates with several Apache products, such as Hadoop (HDFS) and
>> > Spark. Marvin shares some similar features with PredictionIO,
>> specifically
>> > the model application server and a design pattern that was inspired by
>> the
>> > DASE. Despite these similarities, Marvin is catered towards a different
>> > clientele (data scientists), and for that reason, it includes many
>> critical
>> > features that are not provided by PredictionIO.
>> >
>> > === An Excessive Fascination with the Apache Brand ===
>> >
>> > While the ASF brand will undoubtedly help Marvin become a successful
>> > project, Marvin is already gaining traction at companies around the
>> globe.
>> >
>> > == Documentation ==
>> >
>> > http://www.marvin-ai.org
>> >
>> >
>> > == Initial Source ==
>> >
>> > The current codebase is available at http://github.com/marvin-ai. This
>> is
>> > practically the same code that will be migrating to the Apache
>> Foundation,
>> > the notable difference being that the multiple repositories will be
>> merged
>> > into a single repository (if necessary).
>> >
>> > These are the main repositories and a very simplified explanation about
>> > each one:
>> >
>> > '''Main repositories'''
>> >
>> >  * marvin-ai/marvin-python-toolbox - Data Science toolbox that helps in
>> the
>> > creation of new ML engines
>> >  * marvin-ai/marvin-engine-executor - Component responsible for
>> > interpreting, serving and managing Marvin engines
>> >  * marvin-ai/marvin-public-engines - Marvin engine examples to help new
>> > Marvin users to build engines
>> >  * marvin-ai/marvin-platform-book - Documentation in GitHub book site
>> format
>> >
>> > '''Secondary repositories (Experimental and Initial)'''
>> >  * marvin-ai/marvin-vagrant-dev - Development environment that uses
>> > VirtualBox and vagrant to non mac and Linux users;
>> >  * marvin-ai/marvin-paper - Source code (latex format) of the first
>> Marvin
>> > paper published in PAPIS.io conference in Boston.
>> >  * marvin-ai/marvin-cluster-admin - Admin module responsible to manage
>> > Marvin cluster;
>> >  * marvin-ai/marvin-automl - AutoML module responsible to help data
>> > scientist to build machine learning models with a very simple visual
>> > interface;
>> >
>> >
>> > == External Dependencies ==
>> >
>> > It is very likely that all our dependencies are using either the Apache
>> or
>> > MIT license. Upon acceptance to the incubator, we would begin a thorough
>> > analysis of all transitive dependencies to verify this fact and introduce
>> > license checking into the build and release process.
>> >
>> > == Required Resources ==
>> >
>> > === Mailing lists ===
>> >
>> >   * private@marvin-ai.incubator.apache.org (with moderated
>> subscriptions)
>> >   * dev@marvin-ai.incubator.apache.org
>> >   * commits@marvin-ai.incubator.apache.org
>> >
>> >
>> > === Git Repositories ===
>> >
>> >   * https://git-wip-us.apache.org/repos/asf/incubator-marvin-ai.git
>> >
>> > === Issue Tracking ===
>> >
>> >   * JIRA (MARVIN)
>> >
>> > == Initial Committers ==
>> >
>> >  * Lucas Bonatto Miguel <lucasbonatto@gmail.com> - Qume Labs
>> (California -
>> > USA)
>> >  * Daniel Takabayashi <daniel.takabayashi@gmail.com> - B2W Digital (São
>> > Paulo - BR) / Laguro.com (California - USA)
>> >  * Bruno Piraja <bruno.piraja@b2wdigital.com> - B2W Digital (São Paulo
>> - BR)
>> >  * Zhang Yifei <zhang.yifei@b2wdigital.com> - B2W Digital (São Paulo
-
>> BR)
>> >  * Harrison Wang <hwang123@mit.edu> - MIT (USA)
>> >  * Brody West <brodyw@mit.edu> - MIT (USA)
>> >  * Rafael Novello <rafael.novello@b2wdigital.com> - B2W Digital (São
>> Paulo
>> > - BR)
>> >  * Willian Leite <willian.leite@cbyk.com.br> - CBYK (São Paulo - BR)
>> >  * Danilo Nunes <nunesdanilo@gmail.com> - Qume Labs (California - USA)
>> >  * Alan Silva <alan.silva@cloudera.com> Cloudera (USA)
>> >  * Jeremy Elster <jeremy.elster@b2wdigital.com> - B2W Digital (São
>> Paulo -
>> > BR)
>> >
>> >
>> > == Sponsors ==
>> >
>> > === Champion ===
>> >
>> >  * Luciano Resende - (lresende)
>> >
>> > === Nominated Mentors ===
>> >
>> >  * Luciano Resende - (lresende)
>> >
>> > === Sponsoring Entity ===
>> > We would like to propose the Apache Incubator to sponsor this project.
>> >
>> > --
>> > Luciano Resende
>> > http://twitter.com/lresende1975
>> > http://lresende.blogspot.com/
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
>> For additional commands, e-mail: general-help@incubator.apache.org
>>
>>
>
> --
> Luciano Resende
> http://twitter.com/lresende1975
> http://lresende.blogspot.com/

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Mime
View raw message