Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Running machine learning locally. Should I use local instance of Datalab, or Jupyter?

I'm doing some machine learning and data analysis on data from Google Analytics and other sources.

I've managed to deploy Cloud Datalab locally and connect to my BigQuery, however I am not sure if this is the best way to do things. I can see that just using vanilla Jupyter notebooks with Pandas I can still connect to BigQuery. Regular Jupyter has the advantage that it runs without Docker, and also has Python 3.

So I'm wondering if there's any benefit to doing this with Cloud Datalab locally besides SQL syntax highlighting? In short, are all the benefits of Cloud Datalab relevant only for cloud computing, or does it bring any advantages over Jupyter for local deployments too?

Thanks!

like image 921
Tom Avatar asked Dec 05 '25 20:12

Tom


1 Answers

Even if you are using regular Jupyter, you can also still install the Datalab python package to use most of the datalab functionality.

My reasons for using Datalab over Jupyter when running locally are:

  1. Running a docker brings a well-tested environment.
  2. PyDatalab brings BigQuery APIs and magics, which create a good BigQuery playground. google.datalab.bigquery offers more than just creating a dataframe out of query.
  3. BigQuery's integration with charting (%%chart can take BQ queries).
  4. Machine learning tools and MLToolbox.
  5. Different UI.

Jupyter+The Datalab Package gives you 2, 3, and 4, though.

like image 147
Chris Meyers Avatar answered Dec 10 '25 00:12

Chris Meyers



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!