Are you into Formula 1? Take a look at their data

Auto racing has embraced Big Data and data science.  This web page at AWS discusses their partnership with Formula 1, with each race car generating over 1.1 million data points per second that are transmitted from the car to the pits.  If you want to play around with some of the data, there is a Python module named Fast F1 That will provide access to data and has examples of some analysis you can do with the data.  Another example using this data is covered in this Medium post.

A different set of historical data on Formula 1 is also available through Kaggle.


What is Data Science?

If you Google for a definition, you will no doubt find Venn diagrams with three circles (Venn diagrams are those figures you have seen with overlapping circles showing the relationships between things).  If you Google for “data science Venn diagram”, you will find some where folks went wild and have a dozen overlapping circles, but most have 3 overlapping circles (maybe because people who write definitions, such as academics, always seem to want to describe things as three-legged stools).

Recently Datanami had an interview with Jeffrey Ullman from Stanford who is a big name in computer science (particularly databases), and in 2020 won the Turing Award (think Nobel prize in computer science).  The article is short, but interesting, and he points out that everyone has their own diagram that emphasizes their domain!

You can find the article at this link.

Google Cloud Next ’21 is coming up next week, virtual, and free

Google has a cloud computing event on October 12-14th that has sessions on AI & ML (which may be of interest if you took the machine learning seminar), data analytics (which may be interesting if you found any of the Data Science for All seminars interesting), and other topics ranging from Databases to Diversity, Equity & Inclusion, which may be of particular interest if you are participating in the DEI Summit that the MISA group here at SJSU is hosting today!

I noticed the conference because there is going to be a Kaggle session titled, “The State of Data Science and Machine Learning 2021” on October 14th at 3:30 PST that will include Julia Eliott (Kaggle’s interim CEO), Kaggle grandmasters, and others.  If you participated in the free Kaggle machine learning course we posted about this past summer, this may be of interest to you.

To Register for the event, go to:

Microsoft Ignite (Virtual) Conference Online (and free) November 2-4

Microsoft is hosting a technology conference online called Microsoft Ignite on November 2-4 and the registration is free.

You can learn more and register at the following URL:

Since it is a Microsoft conference, the speakers and presentations are focused on Microsoft, but if you are trying to learn some of their tools, they have a “learning zone” and training on some of their products.

Since it’s free, you may as well check it out!

Tableau Conference ’21 (Nov. 9 – 12) – Registration Open (virtual and free)

The registration for Tableau Conference 21 is now open, virtual, and completely free!  You can register at:

The conference is online in November from Tuesday, 9th – Friday, 12th 9am-3pm SJSU time (a.k.a. Pacific time).  In case you are at your chateau in France that week, they also have broadcast times slightly delayed for Europe and Asia Pacific.

There are a number of speakers and discussions (keynote speakers were not announced yet, other than their CEO), but if you sign up early, you can reserve a spot to talk with one of their “Tableau Doctors” to help diagnose visualization problems vexing you or a project you are working on.  Also, if there is a talk that interests you, you can also join a live Q&A session with the speakers and ask questions.