Gradient Flow #43: Graph Databases; Language Understanding; Program Synthesis

Subscribe • Previous Issues

“Done is better than perfect.” - Sheryl Sandberg

Data Exchange podcast

  • The State of Data Journalism    A conversation with Tara Kelly, Data Editor at DataJournalism.com (DJC) an organization created by the European Journalism Centre. DJC provides journalists and media groups with free resources, materials, online video courses and community forums. 

  • Why Graph Databases and Graph Analytics are hot again   Our friend Paco Nathan has been doing a lot of work with graphs and as such he’s had to immerse himself in the world of graph data management. This conversation is focused on what’s new with graph databases, use cases of graph databases, graph analytics, and graph neural networks.


Featured FREE Virtual conference

I’m once again the co-chair of the NLP Summit and we have another great lineup for you this year. We have speakers and case studies from leading organizations including Hugging Face, Stanford NLP and Stanza, Spark NLP, Morgan Stanley, Microsoft Research, Eleuther, and AI21 Labs - creators of the largest language model available to developers.

REGISTER


Data & Machine Learning Tools and Infrastructure

  • The Data Lakehouse :: FAQ    A data management paradigm that we first introduced last year is quietly and steadily gaining traction.

  • Data Validation Tool   In our soon to be released Data Engineering Survey, respondents cited Data Quality and Data Validation as one of the key challenges facing their data teams. This newly open sourced library from Google is a Python tool that provides an automated and repeatable solution for data validation across different environments. 

  • Darts   An open source Python library for easy manipulation and forecasting of time series. Among other things, Darts lowers the barrier for using deep learning models for forecasting and allows you to train on multiple (thousands or more) of possibly multi-dimensional time series.

  • Whale: Scaling Deep Learning Model Training to the Trillions

  • River   An open source Python library for online machine learning.

[Image: Examples of real-world situations that can cause a model to degrade]

Recommendations


Closing short: Solo Band


If you enjoyed this newsletter please support our work by encouraging your friends and colleagues to subscribe:


Ben Lorica edits the Gradient Flow newsletter. He is co-chair of the Ray Summit, external chair of the NLP Summit, and host of the Data Exchange podcast. You can follow him on Twitter @BigData. This newsletter is produced by Gradient Flow.

Loading more posts…