Top applications across the top players

Amazon

  • Frequently Bought Together
  • Customers Who Bought This Item Also Bought

Netflix

  • Movie Recommendation

Facebook

  • People You Might Know (aka friend suggestions)
  • Face detection

Bing

  • Entity extraction from web page and queries, like names, addresses. It was running inside IE toolbar, Bing index generation and query processing.

Google

  • Click Fraud Detection

Yelp

NLP API Services

Deep Learning Datasets

Interesting Public Datasets

There are quite a bit of ML competitions in Kaggle. And each of these competitions, a good amount of dataset are released in public. Here are the list of datasets that I found interesting.

Entertainment Datasets

A set of celebrity, images and movie data below. It is about 1000 to 2000 celebrities. You can cross check People.com for its completeness.
* Celebrity Face on Web from Microsoft
* Celebrity Twitter Accounts – over 1000+ celebrity twitter accounts there.
* Cross-Age Celebrity Dataset (CACD)

Proper Name

eCommerce

Internet Marketing

Finance

Other Interesting DataSets

Reference

log in

Use demo/demo public access

reset password

Back to
log in
Choose A Format
Personality quiz
Trivia quiz
Poll
Story
List
Meme
Video
Audio
Image