madlib-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "FENG, Xixuan (Aaron)" <xixuan.f...@gmail.com>
Subject [REPORT] MADlib - July 2019
Date Tue, 09 Jul 2019 07:50:26 GMT
## Description:

- Apache MADlib is a scalable, big data, SQL-driven machine learning
framework for data scientists.


## Issues:

- There are no issues requiring board attention at this time.


## Activity:

- Last release was 1.16 which was the 6th release as
an Apache TLP project.  This was a significant release
that included initial support for distributed training
of deep learning models with GPU acceleration, utilities
to load model architectures and weights, preprocessing
of images for mini-batch gradient descent, and support
for Greenplum 6 and PostgreSQL 11.  Plus the usual bug
fixes and minor improvements.

- Community is at work on the 1.17 release.  Scope is
still being decided by the community, but JIRAs
call for improvements to deep learning as a follow on
to 1.16, and improvements to correlation/covariance,
association rules and decision tree.

- After that will be the 2.0 release with JIRAs related
to versioning models.

- Frank McQuillan (MADlib committer and PMC member) presented
at Dell Tech World on 2019-Apr-30 on MADlib and Greenplum Database
in a talk called "AI in a Box".

- Yuhao Zhang, a PhD candidate at University of California, San Diego
is doing an internship at Pivotal in Palo Alto to work on
parameter selection in MADlib, which is an important area for
deep learning practitioners.  Yuhao's advisor at UCSD is Arun Kumar
in the Department of Computer Science and Engineering, whose
research has contributed to MADlib in the past.

## Health report:

The community is relatively small but very engaged with robust mailing
list traffic, interest in doing frequent releases and new
functionality being developed by contributors.

The number of developers actively contributing to the code/documentation
is approximately 8 in the 2nd quarter of calendar year 2019.

We will constantly be on a lookout for new community members to be
invited either as committers or PMC.


## PMC changes:

- No changes in the last quarter.  Currently stands at 14 PMC members.


## Committer base changes:

- Currently 14 committers.

- Last committer additions were Jingyi Mei on
2018-06-14 and Nikhil Kak on 2018-06-27.


## Releases:

- Next release: v1.17 planned for 3Q2019

- v1.16.0 released on 2019-07-08

- v1.15.1 released on 2018-10-15

- v1.15.0 released on 2018-08-10


## Mailing list activity:

Average monthly mailing list activity was 620 posts to dev@
and 11 posts to user@ for the last 3 months Apr-Jun.


## JIRA Statistics:

- 12 JIRA tickets created in the last month

- 13 JIRA tickets resolved in the last month

Mime
View raw message