“Done is better than perfect.” - Sheryl Sandberg
Data Exchange podcast
The State of Data Journalism A conversation with Tara Kelly, Data Editor at DataJournalism.com (DJC) an organization created by the European Journalism Centre. DJC provides journalists and media groups with free resources, materials, online video courses and community forums.
Why Graph Databases and Graph Analytics are hot again Our friend Paco Nathan has been doing a lot of work with graphs and as such he’s had to immerse himself in the world of graph data management. This conversation is focused on what’s new with graph databases, use cases of graph databases, graph analytics, and graph neural networks.
Featured FREE Virtual conference
I’m once again the co-chair of the NLP Summit and we have another great lineup for you this year. We have speakers and case studies from leading organizations including Hugging Face, Stanford NLP and Stanza, Spark NLP, Morgan Stanley, Microsoft Research, Eleuther, and AI21 Labs - creators of the largest language model available to developers.
Data & Machine Learning Tools and Infrastructure
Data Validation Tool In our soon to be released Data Engineering Survey, respondents cited Data Quality and Data Validation as one of the key challenges facing their data teams. This newly open sourced library from Google is a Python tool that provides an automated and repeatable solution for data validation across different environments.
Darts An open source Python library for easy manipulation and forecasting of time series. Among other things, Darts lowers the barrier for using deep learning models for forecasting and allows you to train on multiple (thousands or more) of possibly multi-dimensional time series.
River An open source Python library for online machine learning.
Program Synthesis with Large Language Models This new paper from Google Research investigates whether large language models can be used to synthesize code in a general-purpose language → “it is worth emphasizing that we are a long way from models that can synthesize complex applications without human supervision”.
Prettymaps Python library for drawing gorgeous customized maps from OpenStreetMap data.
Machine Learning - A First Course for Engineers and Scientists FREE preliminary version of an upcoming book based on a course at Uppsala University.
Closing short: Solo Band
If you enjoyed this newsletter please support our work by encouraging your friends and colleagues to subscribe:
Ben Lorica edits the Gradient Flow newsletter. He is co-chair of the Ray Summit, external chair of the NLP Summit, and host of the Data Exchange podcast. You can follow him on Twitter @BigData. This newsletter is produced by Gradient Flow.