incubator-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Luciano Resende <luckbr1...@gmail.com>
Subject Re: [DISCUSS] Marvin-AI Incubator Proposal
Date Tue, 21 Aug 2018 04:49:30 GMT
Great, thank you... i will update the wiki and send a vote in the morning

On Mon, Aug 20, 2018 at 18:15 William Colen <colen@apache.org> wrote:

> Luciano,
>
> Now I can properly volunteer myself as a mentor.
>
> Thank you
> William
>
> Em seg, 20 de ago de 2018 às 19:37, William Colen <colen@apache.org>
> escreveu:
>
> > Luciano, I am very happy to see this incubation proposal here. I am
> > following this project for a while. Knowing the reputation on the
> > committees involved and B2W I know it can be very successful.
> > I am waiting for the board aproval for joining IPMC. Once approved I
> would
> > happly volunteer myself as a mentor.
> >
> > Em seg, 20 de ago de 2018 às 17:45, Luciano Resende <
> luckbr1975@gmail.com>
> > escreveu:
> >
> >> We are still looking for one more mentor before we put this proposal to
> a
> >> vote, any volunteers?
> >>
> >> On Wed, Aug 15, 2018 at 12:13 PM Luciano Resende <luckbr1975@gmail.com>
> >> wrote:
> >>
> >> > We would like to start a discussion on accepting Marvin-AI as an
> Apache
> >> > Incubator project.
> >> >
> >> > The proposal is available at the incubator wiki, and also copied
> below:
> >> > https://wiki.apache.org/incubator/Marvin-AI
> >> >
> >> > As part of the initial due diligence, we have done a preliminary name
> >> > search and the results are available on the JIRA below:
> >> >
> >> > https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-144
> >> >
> >> > We are also looking for two additional mentors.
> >> >
> >> >
> >> > Thanks in advance for your time reviewing and providing feedback.
> >> >
> >> > ===
> >> >
> >> > = Marvin-AI =
> >> >
> >> > == Abstract ==
> >> >
> >> > Marvin-AI is an open-source artificial intelligence (AI) platform that
> >> > helps data scientists, prototype and productionalize complex solutions
> >> with
> >> > a scalable, low-latency, language-agnostic, and standardized
> >> architecture
> >> > while simplifies the process of exploration and modeling.
> >> >
> >> > == Proposal ==
> >> >
> >> > Marvin helps non-experienced developers create industry-grade AI
> >> > applications. It has three core components:  a development environment
> >> to
> >> > be used during data exploration and hypothesis validation (Toolbox), a
> >> > library which should be extended to create Marvin engines, and a Scala
> >> > application server which interprets engines (Engine Executor).
> >> > A basic premise of Marvin is that it should be language-agnostic, able
> >> to
> >> > interpret engines implemented in different programming languages.
> >> >
> >> > == Background ==
> >> >
> >> > The Marvin AI project was initiated as an internal project at B2W
> >> Digital
> >> > (Brazil), the largest e-commerce company in Latin America. Nowadays,
> it
> >> is
> >> > used by all data scientists within the B2W team. Oftentimes, data
> >> > scientists don't have an extensive background in software engineering,
> >> yet
> >> > are in charge of creating AI applications that need to scale to high
> >> > throughput and provide millisecond-level response times. At B2W,
> Marvin
> >> AI
> >> > plays an important role in this process, abstracting advanced software
> >> > engineering procedures, allowing data scientists to focus on their
> >> > knowledge domain.
> >> >
> >> > == Rationale ==
> >> >
> >> > With recent advances in computer architecture and a corresponding
> >> increase
> >> > in the amount of data generated by always-connected devices, AI
> >> algorithms
> >> > offer a solution to problems that have long troubled modern
> >> corporations.
> >> > Since AI developers come from various fields, such as statistics,
> >> physics,
> >> > and math, there exists a strong need for platforms which enable them
> to
> >> > move from prototypes to enterprise applications. Although some tools
> >> claim
> >> > to offer this service, in reality, there is no reliable open-source
> >> > solution.
> >> >
> >> > == Initial Goals ==
> >> >
> >> > The initial goals will most likely be to merge the existing codebase
> >> into
> >> > a single repository, migrate it to Apache, and then integrate with the
> >> > Apache development process. Furthermore, we plan for incremental
> >> > development and releases, as per Apache guidelines.
> >> >
> >> > == Current Status ==
> >> >
> >> > === Meritocracy ===
> >> >
> >> > Marvin already works under principles of meritocracy. Today, Marvin
> >> > already has some contributors that are part of other institutions.
> >> Although
> >> > there is no formal process defined to become a committer, contributors
> >> that
> >> > make major changes/improvements to the platform are naturally granted
> >> write
> >> > access to the repository.
> >> >
> >> >
> >> > === Community ===
> >> >
> >> > Acceptance into the Apache foundation would substantially boost both
> >> > Marvin's user and developer communities. The current community
> includes
> >> a
> >> > few experienced developers that have either academic or professional
> >> > experience with AI. The community is largely comprised of data
> >> scientists
> >> > working at B2W and other companies such as Cloudera, MIT, Qume Labs,
> >> > Laguro.com, and CBYK. Also, there is a  meetup group of hundreds of
> >> users
> >> > who meet regularly to exchange ideas about Marvin and, more generally,
> >> AI.
> >> >
> >> > Reference to the group: https://www.meetup.com/marvin-ai/members/
> >> >
> >> > === Core Developers ===
> >> >
> >> > The core developers for Marvin are listed in the contributor's list
> and
> >> > initial PPMC below. These lists include B2W employees, MIT students,
> >> UFSCAR
> >> > researchers, independent contributors, and some employees of other
> >> > companies like Cloudera, Qume Labs, Laguro.com, and CBYK.
> >> >
> >> > === Alignment ===
> >> >
> >> > The initial committers strongly believe that by being part of the
> Apache
> >> > Software Foundation, Marvin AI will be part of a comprehensive suite
> >> for AI
> >> > applications that can process big data and enable enterprises to
> extract
> >> > value from their data lakes. Also, we hope that by integrating with
> >> other
> >> > Apache projects such as Apache Spark, Apache Hadoop; that this will
> >> foster
> >> > additional collaboration between these projects furthering the already
> >> > existing integration points and expanding the community of
> contributors.
> >> >
> >> >
> >> > == Known Risks ==
> >> >
> >> > === Orphaned products ===
> >> >
> >> > Given the current maturity of Marvin and how well it has been received
> >> at
> >> > technical conferences, the risk of the project being abandoned is
> >> minimal.
> >> > AI is not academia-exclusive anymore, and as enterprises start to add
> >> > data-science pipelines to their applications, demand for Marvin will
> >> only
> >> > increase.
> >> >
> >> > === Inexperience with Open Source ===
> >> >
> >> > Marvin AI has been an open-source project since October 2017. The
> >> project
> >> > was started in a company where open-source culture is foundational.
> B2W
> >> > Digital runs the largest e-commerce in Latin America on top of
> >> open-source
> >> > projects.
> >> >
> >> > === Reliance on Salaried Developers ===
> >> >
> >> > Marvin AI receives substantial efforts from salaried developers -- a
> few
> >> > of which were hired by companies to work exclusively for the project
> --
> >> but
> >> > the majority devote "after-hours" or spare time to this project. Some
> >> > developers are graduate students that contribute in their free time at
> >> > school.
> >> >
> >> > === Relationships with Other Apache Products ===
> >> >
> >> > Marvin integrates with several Apache products, such as Hadoop (HDFS)
> >> and
> >> > Spark. Marvin shares some similar features with PredictionIO,
> >> specifically
> >> > the model application server and a design pattern that was inspired by
> >> the
> >> > DASE. Despite these similarities, Marvin is catered towards a
> different
> >> > clientele (data scientists), and for that reason, it includes many
> >> critical
> >> > features that are not provided by PredictionIO.
> >> >
> >> > === An Excessive Fascination with the Apache Brand ===
> >> >
> >> > While the ASF brand will undoubtedly help Marvin become a successful
> >> > project, Marvin is already gaining traction at companies around the
> >> globe.
> >> >
> >> > == Documentation ==
> >> >
> >> > http://www.marvin-ai.org
> >> >
> >> >
> >> > == Initial Source ==
> >> >
> >> > The current codebase is available at http://github.com/marvin-ai.
> This
> >> is
> >> > practically the same code that will be migrating to the Apache
> >> Foundation,
> >> > the notable difference being that the multiple repositories will be
> >> merged
> >> > into a single repository (if necessary).
> >> >
> >> > These are the main repositories and a very simplified explanation
> about
> >> > each one:
> >> >
> >> > '''Main repositories'''
> >> >
> >> >  * marvin-ai/marvin-python-toolbox - Data Science toolbox that helps
> in
> >> > the creation of new ML engines
> >> >  * marvin-ai/marvin-engine-executor - Component responsible for
> >> > interpreting, serving and managing Marvin engines
> >> >  * marvin-ai/marvin-public-engines - Marvin engine examples to help
> new
> >> > Marvin users to build engines
> >> >  * marvin-ai/marvin-platform-book - Documentation in GitHub book site
> >> > format
> >> >
> >> > '''Secondary repositories (Experimental and Initial)'''
> >> >  * marvin-ai/marvin-vagrant-dev - Development environment that uses
> >> > VirtualBox and vagrant to non mac and Linux users;
> >> >  * marvin-ai/marvin-paper - Source code (latex format) of the first
> >> Marvin
> >> > paper published in PAPIS.io conference in Boston.
> >> >  * marvin-ai/marvin-cluster-admin - Admin module responsible to manage
> >> > Marvin cluster;
> >> >  * marvin-ai/marvin-automl - AutoML module responsible to help data
> >> > scientist to build machine learning models with a very simple visual
> >> > interface;
> >> >
> >> >
> >> > == External Dependencies ==
> >> >
> >> > It is very likely that all our dependencies are using either the
> Apache
> >> or
> >> > MIT license. Upon acceptance to the incubator, we would begin a
> thorough
> >> > analysis of all transitive dependencies to verify this fact and
> >> introduce
> >> > license checking into the build and release process.
> >> >
> >> > == Required Resources ==
> >> >
> >> > === Mailing lists ===
> >> >
> >> >   * private@marvin-ai.incubator.apache.org (with moderated
> >> subscriptions)
> >> >   * dev@marvin-ai.incubator.apache.org
> >> >   * commits@marvin-ai.incubator.apache.org
> >> >
> >> >
> >> > === Git Repositories ===
> >> >
> >> >   * https://git-wip-us.apache.org/repos/asf/incubator-marvin-ai.git
> >> >
> >> > === Issue Tracking ===
> >> >
> >> >   * JIRA (MARVIN)
> >> >
> >> > == Initial Committers ==
> >> >
> >> >  * Lucas Bonatto Miguel <lucasbonatto@gmail.com> - Qume Labs
> >> (California
> >> > - USA)
> >> >  * Daniel Takabayashi <daniel.takabayashi@gmail.com> - B2W Digital
> (São
> >> > Paulo - BR) / Laguro.com (California - USA)
> >> >  * Bruno Piraja <bruno.piraja@b2wdigital.com> - B2W Digital (São
> Paulo
> >> -
> >> > BR)
> >> >  * Zhang Yifei <zhang.yifei@b2wdigital.com> - B2W Digital (São Paulo
> -
> >> BR)
> >> >  * Harrison Wang <hwang123@mit.edu> - MIT (USA)
> >> >  * Brody West <brodyw@mit.edu> - MIT (USA)
> >> >  * Rafael Novello <rafael.novello@b2wdigital.com> - B2W Digital (São
> >> > Paulo - BR)
> >> >  * Willian Leite <willian.leite@cbyk.com.br> - CBYK (São Paulo -
BR)
> >> >  * Danilo Nunes <nunesdanilo@gmail.com> - Qume Labs (California -
> USA)
> >> >  * Alan Silva <alan.silva@cloudera.com> Cloudera (USA)
> >> >  * Jeremy Elster <jeremy.elster@b2wdigital.com> - B2W Digital (São
> >> Paulo
> >> > - BR)
> >> >
> >> >
> >> > == Sponsors ==
> >> >
> >> > === Champion ===
> >> >
> >> >  * Luciano Resende - (lresende)
> >> >
> >> > === Nominated Mentors ===
> >> >
> >> >  * Luciano Resende - (lresende)
> >> >
> >> > === Sponsoring Entity ===
> >> > We would like to propose the Apache Incubator to sponsor this project.
> >> >
> >> > --
> >> > Luciano Resende
> >> > http://twitter.com/lresende1975
> >> > http://lresende.blogspot.com/
> >> >
> >>
> >>
> >> --
> >> Luciano Resende
> >> http://twitter.com/lresende1975
> >> http://lresende.blogspot.com/
> >>
> >
>
-- 
Sent from my Mobile device

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message