Home Database Mysql Tutorial 数据挖掘方面重要会议的最佳paper集合

数据挖掘方面重要会议的最佳paper集合

Jun 07, 2016 pm 03:56 PM
data mining optimal gather

数据挖掘方面重要会议的最佳paper集合,后续将陆续分析一下内容: 主要有KDD、SIGMOD、VLDB、ICML、SIGIR KDD (Data Mining) 2013 Simple and Deterministic Matrix Sketching Edo Liberty, Yahoo! Research 2012 Searching and Mining Trillions of Time Se

数据挖掘方面重要会议的最佳paper集合,后续将陆续分析一下内容:

主要有KDD、SIGMOD、VLDB、ICML、SIGIR

KDD (Data Mining)

2013

Simple and Deterministic Matrix Sketching

Edo Liberty, Yahoo! Research

2012

Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping

Thanawin Rakthanmanon, University of California Riverside; et al.

2011

Leakage in Data Mining: Formulation, Detection, and Avoidance

Shachar Kaufman, Tel-Aviv University; et al.

2010

Large linear classification when data cannot fit in memory

Hsiang-Fu Yu, National Taiwan University; et al.

Connecting the dots between news articles

Dafna Shahaf & Carlos Guestrin, Carnegie Mellon University

2009

Collaborative Filtering with Temporal Dynamics

Yehuda Koren, Yahoo! Research

2008

Fastanova: an efficient algorithm for genome-wide association study

Xiang Zhang, University of North Carolina at Chapel Hill; et al.

2007

Predictive discrete latent factor models for large scale dyadic data

Deepak Agarwal & Srujana Merugu, Yahoo! Research

2006

Training linear SVMs in linear time

Thorsten Joachims, Cornell University

2005

Graphs over time: densification laws, shrinking diameters and possible explanations

Jure Leskovec, Carnegie Mellon University; et al.

2004

A probabilistic framework for semi-supervised clustering

Sugato Basu, University of Texas at Austin; et al.

2003

Maximizing the spread of influence through a social network

David Kempe, Cornell University; et al.

2002

Pattern discovery in sequences under a Markov assumption

Darya Chudova & Padhraic Smyth, University of California Irvine

2001

Robust space transformations for distance-based operations

Edwin M. Knorr, University of British Columbia; et al.

2000

Hancock: a language for extracting signatures from data streams

Corinna Cortes, AT&T Laboratories; et al.

1999

MetaCost: a general method for making classifiers cost-sensitive

Pedro Domingos, Universidade Técnica de Lisboa

1998

Occam's Two Razors: The Sharp and the Blunt

Pedro Domingos, Universidade Técnica de Lisboa

1997

Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Di...

Foster Provost & Tom Fawcett, NYNEX Science and Technology

SIGMOD (Databases)

2013

Massive Graph Triangulation

Xiaocheng Hu, The Chinese University of Hong Kong; et al.

2012

High-Performance Complex Event Processing over XML Streams

Barzan Mozafari, Massachusetts Institute of Technology; et al.

2011

Entangled Queries: Enabling Declarative Data-Driven Coordination

Nitin Gupta, Cornell University; et al.

2010

FAST: fast architecture sensitive tree search on modern CPUs and GPUs

Changkyu Kim, Intel; et al.

2009

Generating example data for dataflow programs

Christopher Olston, Yahoo! Research; et al.

2008

Serializable isolation for snapshot databases

Michael J. Cahill, University of Sydney; et al.

Scalable Network Distance Browsing in Spatial Databases

Hanan Samet, University of Maryland; et al.

2007

Compiling mappings to bridge applications and databases

Sergey Melnik, Microsoft Research; et al.

Scalable Approximate Query Processing with the DBO Engine

Christopher Jermaine, University of Florida; et al.

2006

To search or to crawl?: towards a query optimizer for text-centric tasks

Panagiotis G. Ipeirotis, New York University; et al.

2004

Indexing spatio-temporal trajectories with Chebyshev polynomials

Yuhan Cai & Raymond T. Ng, University of British Columbia

2003

Spreadsheets in RDBMS for OLAP

Andrew Witkowski, Oracle; et al.

2001

Locally adaptive dimensionality reduction for indexing large time series databases

Eamonn Keogh, University of California Irvine; et al.

2000

XMill: an efficient compressor for XML data

Hartmut Liefke, University of Pennsylvania
Dan Suciu, AT&T Laboratories

1999

DynaMat: a dynamic view management system for data warehouses

Yannis Kotidis & Nick Roussopoulos, University of Maryland

1998

Efficient transparent application recovery in client-server information systems

David Lomet & Gerhard Weikum, Microsoft Research

Integrating association rule mining with relational database systems: alternatives and implications

Sunita Sarawagi, IBM Research; et al.

1997

Fast parallel similarity search in multimedia databases

Stefan Berchtold, University of Munich; et al.

1996

Implementing data cubes efficiently

Venky Harinarayan, Stanford University; et al.

VLDB (Databases)

2013

DisC Diversity: Result Diversification based on Dissimilarity and Coverage

Marina Drosou & Evaggelia Pitoura, University of Ioannina

2012

Dense Subgraph Maintenance under Streaming Edge Weight Updates for Real-time Story Identification

Albert Angel, University of Toronto; et al.

2011

RemusDB: Transparent High-Availability for Database Systems

Umar Farooq Minhas, University of Waterloo; et al.

2010

Towards Certain Fixes with Editing Rules and Master Data

Shuai Ma, University of Edinburgh; et al.

2009

A Unified Approach to Ranking in Probabilistic Databases

Jian Li, University of Maryland; et al.

2008

Finding Frequent Items in Data Streams

Graham Cormode & Marios Hadjieleftheriou, AT&T Laboratories

Constrained Physical Design Tuning

Nicolas Bruno & Surajit Chaudhuri, Microsoft Research

2007

Scalable Semantic Web Data Management Using Vertical Partitioning

Daniel J. Abadi, Massachusetts Institute of Technology; et al.

2006

Trustworthy Keyword Search for Regulatory-Compliant Records Retention

Soumyadeb Mitra, University of Illinois at Urbana-Champaign; et al.

2005

Cache-conscious Frequent Pattern Mining on a Modern Processor

Amol Ghoting, Ohio State University; et al.

2004

Model-Driven Data Acquisition in Sensor Networks

Amol Deshpande, University of California Berkeley; et al.

2001

Weaving Relations for Cache Performance

Anastassia Ailamaki, Carnegie Mellon University; et al.

1997

Integrating Reliable Memory in Databases

Wee Teck Ng & Peter M. Chen, University of Michigan

ICML (Machine Learning)

2013

Vanishing Component Analysis

Roi Livni, The Hebrew University of Jerusalum; et al.

Fast Semidifferential-based Submodular Function Optimization

Rishabh Iyer, University of Washington; et al.

2012

Bayesian Posterior Sampling via Stochastic Gradient Fisher Scoring

Sungjin Ahn, University of California Irvine; et al.

2011

Computational Rationalization: The Inverse Equilibrium Problem

Kevin Waugh, Carnegie Mellon University; et al.

2010

Hilbert Space Embeddings of Hidden Markov Models

Le Song, Carnegie Mellon University; et al.

2009

Structure preserving embedding

Blake Shaw & Tony Jebara, Columbia University

2008

SVM Optimization: Inverse Dependence on Training Set Size

Shai Shalev-Shwartz & Nathan Srebro, Toyota Technological Institute at Chicago

2007

Information-theoretic metric learning

Jason V. Davis, University of Texas at Austin; et al.

2006

Trading convexity for scalability

Ronan Collobert, NEC Labs America; et al.

2005

A support vector method for multivariate performance measures

Thorsten Joachims, Cornell University

1999

Least-Squares Temporal Difference Learning

Justin A. Boyan, NASA Ames Research Center

SIGIR (Information Retrieval)

2013

Beliefs and Biases in Web Search

Ryen W. White, Microsoft Research

2012

Time-Based Calibration of Effectiveness Measures

Mark Smucker & Charles Clarke, University of Waterloo

2011

Find It If You Can: A Game for Modeling Different Types of Web Search Success Using Interaction Data

Mikhail Ageev, Moscow State University; et al.

2010

Assessing the Scenic Route: Measuring the Value of Search Trails in Web Logs

Ryen W. White, Microsoft Research
Jeff Huang, University of Washington

2009

Sources of evidence for vertical selection

Jaime Arguello, Carnegie Mellon University; et al.

2008

Algorithmic Mediation for Collaborative Exploratory Search

Jeremy Pickens, FX Palo Alto Lab; et al.

2007

Studying the Use of Popular Destinations to Enhance Web Search Interaction

Ryen W. White, Microsoft Research; et al.

2006

Minimal Test Collections for Retrieval Evaluation

Ben Carterette, University of Massachusetts Amherst; et al.

2005

Learning to estimate query difficulty: including applications to missing content detection and dis...

Elad Yom-Tov, IBM Research; et al.

2004

A Formal Study of Information Retrieval Heuristics

Hui Fang, University of Illinois at Urbana-Champaign; et al.

2003

Re-examining the potential effectiveness of interactive query expansion

Ian Ruthven, University of Strathclyde

2002

Novelty and redundancy detection in adaptive filtering

Yi Zhang, Carnegie Mellon University; et al.

2001

Temporal summaries of new topics

James Allan, University of Massachusetts Amherst; et al.

2000

IR evaluation methods for retrieving highly relevant documents

Kalervo J?rvelin & Jaana Kek?l?inen, University of Tampere

1999

Cross-language information retrieval based on parallel texts and automatic mining of parallel text...

Jian-Yun Nie, Université de Montréal; et al.

1998

A theory of term weighting based on exploratory data analysis

Warren R. Greiff, University of Massachusetts Amherst

1997

Feature selection, perceptron learning, and a usability case study for text categorization

Hwee Tou Ng, DSO National Laboratories; et al.

1996

Retrieving spoken documents by combining multiple index sources

Gareth Jones, University of Cambridge; et al.

推荐一个网站,感谢作者的努力搜集,主要是各种顶级会议的最佳论文集合。

http://jeffhuang.com/best_paper_awards.html

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

The first choice for CS players: recommended computer configuration The first choice for CS players: recommended computer configuration Jan 02, 2024 pm 04:26 PM

1. Processor When choosing a computer configuration, the processor is one of the most important components. For playing games like CS, the performance of the processor directly affects the smoothness and response speed of the game. It is recommended to choose Intel Core i5 or i7 series processors because they have powerful multi-core processing capabilities and high frequencies, and can easily cope with the high requirements of CS. 2. Graphics card Graphics card is one of the important factors in game performance. For shooting games such as CS, the performance of the graphics card directly affects the clarity and smoothness of the game screen. It is recommended to choose NVIDIA GeForce GTX series or AMD Radeon RX series graphics cards. They have excellent graphics processing capabilities and high frame rate output, and can provide a better gaming experience. 3. Memory power

What is the best Python library for hidden Markov models? What is the best Python library for hidden Markov models? Aug 30, 2023 pm 06:45 PM

Hidden Markov Models (HMMs) are a powerful type of statistical model used for modeling sequence data. They have uses in numerous fields including speech recognition, natural language processing, finance, and bioinformatics. Python is a versatile programming language that provides a series of libraries for implementing HMMs. In this article, we will discover unique Python libraries for HMMs and evaluate their functionality, performance and ease of use, sooner or later revealing the best option for your needs. Getting Started with Hidden Markov Models Before diving into these libraries, let’s briefly review the concept of HMMs. HMM is a probabilistic model that represents the transition of a system between hidden states over time. It consists of the following parts - a set of hidden states initial state probability distribution state transition

Why is it difficult to implement collection-like functions in Go language? Why is it difficult to implement collection-like functions in Go language? Mar 24, 2024 am 11:57 AM

It is difficult to implement collection-like functions in the Go language, which is a problem that troubles many developers. Compared with other programming languages ​​such as Python or Java, the Go language does not have built-in collection types, such as set, map, etc., which brings some challenges to developers when implementing collection functions. First, let's take a look at why it is difficult to implement collection-like functionality directly in the Go language. In the Go language, the most commonly used data structures are slice and map. They can complete collection-like functions, but

How to optimize Java collection sorting performance How to optimize Java collection sorting performance Jun 30, 2023 am 10:43 AM

Java is a powerful programming language that is widely used in various types of software development. In Java development, scenarios that often involve sorting collections are involved. However, if performance optimization is not performed for collection sorting, the execution efficiency of the program may decrease. This article will explore how to optimize the performance of Java collection sorting. 1. Choose the appropriate collection class In Java, there are many collection classes that can be used for sorting, such as ArrayList, LinkedList, TreeSet, etc. Different collection classes are in

How to use Go language for data mining? How to use Go language for data mining? Jun 10, 2023 am 08:39 AM

With the rise of big data and data mining, more and more programming languages ​​have begun to support data mining functions. As a fast, safe and efficient programming language, Go language can also be used for data mining. So, how to use Go language for data mining? Here are some important steps and techniques. Data Acquisition First, you need to obtain the data. This can be achieved through various means, such as crawling information on web pages, using APIs to obtain data, reading data from databases, etc. Go language comes with rich HTTP

Data Analysis with MySql: How to Handle Data Mining and Statistics Data Analysis with MySql: How to Handle Data Mining and Statistics Jun 16, 2023 am 11:43 AM

MySql is a popular relational database management system that is widely used in enterprise and personal data storage and management. In addition to storing and querying data, MySql also provides functions such as data analysis, data mining, and statistics that can help users better understand and utilize data. Data is a valuable asset in any business or organization, and data analysis can help companies make correct business decisions. MySql can perform data analysis and data mining in many ways. Here are some practical techniques and tools: Use

Time Series Forecasting Tips in Python Time Series Forecasting Tips in Python Jun 10, 2023 am 08:10 AM

With the advent of the data era, more and more data are collected and used for analysis and prediction. Time series data is a common data type that contains a series of data based on time. The methods used to forecast this type of data are called time series forecasting techniques. Python is a very popular programming language with strong data science and machine learning support, so it is also a very suitable tool for time series forecasting. This article will introduce some commonly used time series forecasting techniques in Python and provide some practical applications

Add all elements from one collection to another using the addAll() method of the HashSet class Add all elements from one collection to another using the addAll() method of the HashSet class Jul 24, 2023 am 08:58 AM

Use the addAll() method of the HashSet class to add all elements in a collection to another collection. HashSet is an implementation class in the Java collection framework. It inherits from AbstractSet and implements the Set interface. HashSet is an unordered set based on a hash table, which does not allow duplicate elements. It provides many commonly used methods to operate elements in the collection, one of which is the addAll() method. The function of the addAll() method is to add the specified

See all articles