Crayon Data
  • Vision
  • Platform
    • Modular Platform Construct
      • ModulesModulesFour components for revenue growth
      • APIsAPIsBuilding blocks of maya.ai’s magic
      • IntegrationsIntegrationsWork seamlessly with platforms and products
    • Built for Scale, Security and Trust
      • ScaleScaleCloud agnostic to scale with ease
      • Patented AIPatented AIReal time recommendations based on tastes
      • Security and PrivacySecurity and PrivacyHow we keep data safe and sound
  • Solutions
    • Verticals
      • Consumer BankConsumer BankDrive customer engagement for revenue growth
      • FintechFintechJoin the digital payment revolution with ease
      • TravelTravelIncrease share of travel wallet with personalization
      • B2B MarketplaceTech DistributionTech products and recommendations to drive sales
      • Merchant MarketplaceRetailWhere the right merchants meet the right customers
    • Resources
      • Where maya.ai innovation becomes tangible with real-life use cases, and ready-to-use demos. Dive in.
      • Use CasesUse Casesmaya.ai’s unique solutions for everything from data to CX
  • Ecosystem
    • PartnersPartners
    • ClientsClients
    • Slaves to the AlgoSlaves to the Algo
  • Life at Crayon
    • Join UsJoin Us
    • Our ValuesOur Values
  • Company
    • About UsAbout Us
    • Company VisionCompany Vision
    • Our TeamOur Team
    • Our InvestorsOur Investors
    • In the LimelightIn the Limelight
  • Enquire now
Select Page

Most influential research papers every data scientist should read!

Data Science   |   
Published August 11, 2014   |   

This is a list of some of the most influential papers in the history of Data Science. We’ve compiled these papers based on recommendations by big data enthusiasts in various social media channels. In case, we’ve missed out any important paper, let us know.
The PageRank Citation Ranking: Bringing Order to the Web
MapReduce: Simplified Data Processing on Large Clusters
The Google File System
Amazon’s Dynamo
Bigtable: A Distributed Storage System for Structured Data
A Few Useful Things to Know about Machine Learning
Random Forests
A Relational Model of Data for Large Shared Data Banks
Map-Reduce for Machine Learning on Multicore
Pasting Small Votes for Classification in Large Databases and On-Line
Recommendations Item-to-Item Collaborative Filtering
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
Spanner: Google’s Globally-Distributed Database
Megastore: Providing Scalable, Highly Available Storage for Interactive Services
F1: A Distributed SQL Database That Scales
APACHE DRILL: Interactive Ad-Hoc Analysis at Scale
A New Approach to Linear Filtering and Prediction Problems
Top 10 algorithms on Data mining

Subscribe to the Crayon Blog

Get the latest posts in your inbox!

Sign up here
Vision

Platform

Modules

APIs

Integrations

Scale

Patented AI

Security and Privacy

Solutions

Consumer Bank

Fintech

Travel

Tech Distribution

Retail

Use Cases

Ecosystem

Partners

Clients

Slaves to the Algo

Life at Crayon

Join Us

Our Values

Company

About Us

Company Vision

Our Team

Our Investors

In the Limelight

Crayon Logo
AI-led customer experience platform for revenue acceleration

Singapore  |  India  |  Dubai  |  USA

 

Crayon Data Pte Ltd
18 Cross Street, #02-101
Singapore 048423

 

Crayon Data India Pvt Ltd
5th Floor, Module 53 & 51,
Software Block, Elnet Software City,
TS 140, Block 2&9, Rajiv Gandhi Road,
Taramani, Chennai – 600 113
Tamil Nadu, India

  • Follow
  • Follow
  • Follow
  • Follow
  • Follow

© 2023 Crayon Data Pvt Ltd.  All Rights Reserved

 

Privacy Policy      Cookie Policy