Kaggle Competition Jigsaw Rate Severity of Toxic Comments

Jigsaw Rate Severity of Toxic Comments

The definition of a toxic comment on the Internet is subjective. Each individual may have their own bar set differently. But which comment is truly worse? Can you help determine the ‘severity’ of a comment?

In this competition, Jigsaw returns to the discussions from Wikipedia Talk pages. You will score a set of about fourteen thousand comments. Your scores will be compared to human rankings performed on comment pairs. In this way, the focus is on the severity of comment toxicity — from innocuous to outrageous, where the middle matters as much as the extremes.

Total Prizes: $50,000

Entry Deadline: January 31, 2022

Kaggle Competitions

Are you interested in testing your data science skills? Kaggle offers a no-setup, customizable, Jupyter Notebooks environment. Access free GPUs and a huge repository of community published data & code. Thousands of data sets are available for exploration and data play. Kaggle also partners with other organizations to host competitions, some for learning, such as this House Price prediction competition, and others offer cash prizes, such as this NFL Big Data Bowl 2022 that investigates special teams performance. Kaggle even offers free micro courses on Pandas, Machine Learning and more!

You can register for a free Kaggle account using any Gmail.

 

What is Data Science?

If you Google for a definition, you will no doubt find Venn diagrams with three circles (Venn diagrams are those figures you have seen with overlapping circles showing the relationships between things).  If you Google for “data science Venn diagram”, you will find some where folks went wild and have a dozen overlapping circles, but most have 3 overlapping circles (maybe because people who write definitions, such as academics, always seem to want to describe things as three-legged stools).

Recently Datanami had an interview with Jeffrey Ullman from Stanford who is a big name in computer science (particularly databases), and in 2020 won the Turing Award (think Nobel prize in computer science).  The article is short, but interesting, and he points out that everyone has their own diagram that emphasizes their domain!

You can find the article at this link.

Google Cloud Next ’21 is coming up next week, virtual, and free

Google has a cloud computing event on October 12-14th that has sessions on AI & ML (which may be of interest if you took the machine learning seminar), data analytics (which may be interesting if you found any of the Data Science for All seminars interesting), and other topics ranging from Databases to Diversity, Equity & Inclusion, which may be of particular interest if you are participating in the DEI Summit that the MISA group here at SJSU is hosting today!

I noticed the conference because there is going to be a Kaggle session titled, “The State of Data Science and Machine Learning 2021” on October 14th at 3:30 PST that will include Julia Eliott (Kaggle’s interim CEO), Kaggle grandmasters, and others.  If you participated in the free Kaggle machine learning course we posted about this past summer, this may be of interest to you.

To Register for the event, go to: https://cloud.withgoogle.com/next