communauté

Apprendre

Bibliothèque d'outils

Outils d'IA

Loisirs

Français

Maison > base de données > tutoriel mysql > Scaling Big Data Mining Infrastructure at Twitter

Scaling Big Data Mining Infrastructure at Twitter

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Libérer： 2016-06-07 16:36:16

original

994 Les gens l'ont consulté

I’m almost always enjoying the lessons learned-style presentations from Twitter’s people. The slides below, by Jimmy Lin and Dmitriy Ryaboy, have been used at HadoopSummit. Besides the technical and practical details, there are two thing

I’m almost always enjoying the lessons learned-style presentations from Twitter’s people. The slides below, by Jimmy Lin and Dmitriy Ryaboy, have been used at HadoopSummit. Besides the technical and practical details, there are two things that I really like:

DJ Patil: “It’s impossible to overstress this: 80% of the work in any data project is in cleaning the data”

and then the reality check:

Your boss says something vague
You think very hard on how to move the needle
Where’s the data?
What’s in this dataset?
What’s all the f#$#$ crap in the data?
Clean the data
Run some off-the-shelf data mining algorithm
…
Productionize, act on the insight
Rinse, repeat

Enjoy!

Scaling Big Data Mining Infrastructure Twitter Experience

Original title and link: Scaling Big Data Mining Infrastructure at Twitter (NoSQL database?myNoSQL)

Scaling Big Data Mining Infrastructure at Twitter

原文地址：Scaling Big Data Mining Infrastructure at Twitter, 感谢原作者分享。

Étiquettes associées：

data Mining scaling

Article précédent：Mysql主从复制，单台服务器上实施 Article suivant：《高性能MySQL》第三版

Déclaration de ce site Web

Le contenu de cet article est volontairement contribué par les internautes et les droits d'auteur appartiennent à l'auteur original. Ce site n'assume aucune responsabilité légale correspondante. Si vous trouvez un contenu suspecté de plagiat ou de contrefaçon, veuillez contacter admin@php.cn

Derniers articles par auteur

Comment fonctionnent les LLM: pré-formation à la formation post-formation, réseaux de neurones, hallucinations et inférence

2025-02-26 03:58:14
J'ai combiné la blockchain et l'IA pour générer de l'art. Voici ce qui s'est passé ensuite.

2025-02-26 03:38:10
Ingénierie rapide avancée: chaîne de pensée (COT)

2025-02-26 03:17:10
Génération augmentée augmentée de récupération dans SQLite

2025-02-26 02:49:09
Comment utiliser un chauffeur à propulsion LLM pour construire votre propre API Node.js

2025-02-26 01:08:13
LLMS pour le codage en 2024: prix, performance et bataille pour le meilleur

2025-02-26 00:46:10
Invitant les modèles de langue de vision

2025-02-25 23:42:08
Comment mesurer la fiabilité de la réponse d'un modèle grand

2025-02-25 22:50:13
Une illusion de la vie

2025-02-25 21:54:11
Les scientifiques vont sérieux au sujet de grands modèles de langue reflétant la pensée humaine

2025-02-25 20:45:11

Derniers numéros

Comment créez-vous une procédure stockée dans MySQL à l'aide de l'instruction Créer la procédure?

2025-03-19 15:55:29
Quels sont les avantages de l'utilisation des vues?

2025-03-19 15:54:25
Comment créez-vous une vue dans MySQL à l'aide de l'instruction Create View?

2025-03-19 15:53:28
Comment déposez-vous une table dans MySQL à l'aide de l'instruction TABLE DROP?

2025-03-19 15:52:28
Comment modifier une table dans MySQL en utilisant l'instruction ALTER TABLE?

2025-03-19 15:51:22

Rubriques connexes

Plus>

Recommandations populaires

Tutoriels populaires

Plus>

Tutoriels associés

Recommandations populaires

Derniers cours

Derniers téléchargements

Plus>

effets Web

Code source du site Web

Matériel du site Web

Modèle frontal