HBase intra row scanning-MySQL 튜토리얼-php.cn

집

데이터 베이스

MySQL 튜토리얼

HBase intra row scanning

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Jun 07, 2016 pm 04:26 PM

hbase

By Lars Hofhansl Updated (again) Wednesday, January 25th, 2012. As I painfully worked through HBASE-5229 I realized that HBase already has all the building blocks needed for complex (local) transactions. What's important here is that (see

By Lars Hofhansl

Updated (again) Wednesday, January 25th, 2012.

As I painfully worked through HBASE-5229 I realized that HBase already has all the building blocks needed for complex (local) transactions.

What's important here is that (see my introduction to HBase):

HBase ensures atomicity for operations for the same row key
HBase keys have internal structure: (row-key, column family, column, ...)

The missing piece was ColumnRangeFilter. With this filter it is possible to retrieve all columns whose identifier starts with "abc", or all columns whose identifier sorts > "test". For example:

// all columns whose identifier starts with "abc"
Filter f = new ColumnRangeFilter(Bytes.toBytes("abc"), true,
Bytes.toBytes("abd"), false);

// all columns whose identifier sorts after "test"
Filter f = new ColumnRangeFilter(Bytes.toBytes("test"), true,
null, true);

So this allows to search (scan) inside a row by column identifier just as HBase allows searching by row key.

A client application can exploit this to achieve transactions by grouping all entities that can participate in the same transaction into a single row (and single column family).
Then using prefixes of the column identifiers can be used to define rows inside that group. Basically the search criteria for keys was moved one level down to the column identifier.

Say we wanted to implement a store with transactional tables that contain rows and columns. One way to doing this with HBase as follows:

the HBase row-key/column-family maps to a "table"
a prefix of the HBase column identifier maps to a "row"
the rest of the HBase column identifier identifies the "column"

This is in fact similar to what Google's Megastore (pdf) does.

This leads to potentially wide HBase rows with many columns. The missing piece is allowing a Scan to efficiently retrieve a slice of a wide row.

This where ColumnRangeFilter comes into play. This filter seeks efficiently into the row by seeking ahead to the first HBase block that contains the first KeyValue (or cell) for that column.

Let's model a table "pets" this way. And let's say a pet has a name and a species. The HBase key for entries would look like this:
(table, CF1, rowA|column1) -> value for column1 in rowA
The code would look something like this:
(apologies for the initial incorrect code that I had posted here)

HTable t = ...;
Scan s = ...;
s.setStartRow("pets");
s.setStopRow("pets");
// get all columns for my pet "fluffy".
Filter f = new ColumnRangeFilter(Bytes.toBytes("fluffy"), true,
Bytes.toBytes("fluffz"), false);
s.setFilter(f);
s.setBatch(20); // avoid getting all columns for the HBase row
ResultScanner rs = t.getScanner(s);
for (Result r = rs.next(); r != null; r = rs.next()) {

// r will now have all HBase columns that start with "fluffy",

// which would represent a single row
for (KeyValue kv : r.raw()) {
// each kv represent - the latest version of - a column
}
}

The downside of this is that HBase achieves atomicity by collocating all cells with the same row-key, so it has to be hosted by a single region server.

原文地址：HBase intra row scanning, 感谢原作者分享。

본 웹사이트의 성명

본 글의 내용은 네티즌들의 자발적인 기여로 작성되었으며, 저작권은 원저작자에게 있습니다. 본 사이트는 이에 상응하는 법적 책임을 지지 않습니다. 표절이나 침해가 의심되는 콘텐츠를 발견한 경우 admin@php.cn으로 문의하세요.

핫 AI 도구

Undresser.AI Undress

사실적인 누드 사진을 만들기 위한 AI 기반 앱

AI Clothes Remover

사진에서 옷을 제거하는 온라인 AI 도구입니다.

Undress AI Tool

무료로 이미지를 벗다

Clothoff.io

AI 옷 제거제

AI Hentai Generator

AI Hentai를 무료로 생성하십시오.

뜨거운 도구

메모장++7.3.1

사용하기 쉬운 무료 코드 편집기

SublimeText3 중국어 버전

중국어 버전, 사용하기 매우 쉽습니다.

스튜디오 13.0.1 보내기

강력한 PHP 통합 개발 환경

드림위버 CS6

시각적 웹 개발 도구

SublimeText3 Mac 버전

신 수준의 코드 편집 소프트웨어(SublimeText3)

뜨거운 주제

Gmail 이메일의 로그인 입구는 어디에 있나요?

7560

Cakephp 튜토리얼

1384

Steam의 계정 이름 형식은 무엇입니까?

Win11 활성화 키 영구

NYT 연결 힌트와 답변

Related knowledge

빅 데이터 저장 및 쿼리를 위해 Beego에서 Hadoop 및 HBase 사용 Jun 22, 2023 am 10:21 AM

빅데이터 시대가 도래하면서 데이터의 처리와 저장이 더욱 중요해지고 있으며, 대용량 데이터를 어떻게 효율적으로 관리하고 분석할 것인가가 기업의 과제가 되었습니다. Apache Foundation의 두 가지 프로젝트인 Hadoop과 HBase는 빅데이터 저장 및 분석을 위한 솔루션을 제공합니다. 이 기사에서는 빅데이터 저장 및 쿼리를 위해 Beego에서 Hadoop 및 HBase를 사용하는 방법을 소개합니다. 1. Hadoop 및 HBase 소개 Hadoop은 오픈 소스 분산 스토리지 및 컴퓨팅 시스템입니다.

springboot에 hbase를 통합하는 방법 May 30, 2023 pm 04:31 PM

종속성: org.springframework.dataspring-data-hadoop-hbase2.5.0.RELEASEorg.apache.hbasehbase-client1.1.2org.springframework.dataspring-data-hadoop2.5.0.RELEASE 구성을 추가하는 공식적인 방법은 xml을 사용하는 것입니다. simple 다시 작성하면 다음과 같습니다. @ConfigurationpublicclassHBaseConfiguration{@Value("${hbase.zooke

Java를 사용하여 HBase 기반 NoSQL 데이터베이스 애플리케이션을 개발하는 방법 Sep 20, 2023 am 08:39 AM

Java를 사용하여 HBase 기반 NoSQL 데이터베이스 애플리케이션을 개발하는 방법 소개: 빅 데이터 시대의 도래와 함께 NoSQL 데이터베이스는 대용량 데이터를 처리하는 중요한 도구 중 하나가 되었습니다. HBase는 오픈소스 분산형 NoSQL 데이터베이스 시스템으로 빅데이터 분야에서 광범위한 애플리케이션을 보유하고 있습니다. 이 기사에서는 Java를 사용하여 HBase 기반 NoSQL 데이터베이스 애플리케이션을 개발하는 방법을 소개하고 구체적인 코드 예제를 제공합니다. 1. HBase 소개: HBase는 Hadoop 기반의 분산 시스템입니다.

Go 언어에서 HBase를 사용하여 효율적인 NoSQL 데이터베이스 애플리케이션 구현 Jun 15, 2023 pm 08:56 PM

빅데이터 시대가 도래하면서 대용량 데이터의 저장과 처리가 더욱 중요해졌습니다. NoSQL 데이터베이스 측면에서 현재 널리 사용되는 솔루션은 HBase입니다. Go 언어는 정적으로 강력한 형식의 프로그래밍 언어로서 간단한 구문과 뛰어난 성능으로 인해 클라우드 컴퓨팅, 웹 사이트 개발, 데이터 과학 등의 분야에서 점점 더 많이 사용되고 있습니다. 이 기사에서는 Go 언어에서 HBase를 사용하여 효율적인 NoSQL 데이터베이스 애플리케이션을 구현하는 방법을 소개합니다. HBase 소개 HBase는 확장성이 뛰어나고 신뢰성이 높은 기본 솔루션입니다.

NoSQL 데이터베이스와 분산 스토리지를 구현하기 위해 PHP와 Apache HBase가 통합되었습니다. Jun 25, 2023 pm 06:01 PM

인터넷 애플리케이션과 데이터 양이 지속적으로 증가함에 따라 기존 관계형 데이터베이스는 더 이상 대규모 데이터를 저장하고 처리해야 하는 요구 사항을 충족할 수 없습니다. NoSQL(NotOnlySQL)은 새로운 유형의 데이터베이스 관리 시스템으로 대용량 데이터 저장 및 처리에 상당한 이점을 갖고 있어 점점 더 많은 관심과 활용을 받고 있습니다. NoSQL 데이터베이스 중 ApacheHBase는 Google의 BigTable 아이디어를 기반으로 설계되었으며 매우 인기 있는 오픈소스 분산 데이터베이스입니다.

Beego에서 데이터 저장 및 쿼리를 위해 HBase 사용 Jun 22, 2023 am 11:58 AM

Beego 프레임워크에서 데이터 저장 및 쿼리를 위해 HBase 사용 인터넷 시대의 지속적인 발전으로 인해 데이터 저장 및 쿼리가 점점 더 중요해졌습니다. 빅데이터 시대의 도래와 함께 다양한 데이터 소스가 해당 분야에서 중요한 위치를 점유하고 있습니다. 비관계형 데이터베이스는 데이터 저장 및 쿼리 측면에서 확실한 장점을 지닌 데이터베이스이며, HBase는 Hadoop 기반의 분산형 비관계형 데이터베이스입니다. 관계형 데이터베이스. 이 기사에서는 Beego 프레임워크에서 데이터 저장 및 쿼리를 위해 HBase를 사용하는 방법을 소개합니다. 1.H

Workerman에서 데이터 저장 및 쿼리를 위해 HBase를 사용하는 방법 Nov 07, 2023 am 08:30 AM

Workerman은 다수의 동시 연결을 호스팅할 수 있는 고성능 PHPsocket 프레임워크입니다. 기존 PHP 프레임워크와 달리 Workerman은 Apache 또는 Nginx와 같은 웹 서버에 의존하지 않고 대신 PHP 프로세스를 시작하여 전체 애플리케이션을 실행합니다. Workerman은 매우 높은 작업 효율성과 더 나은 부하 용량을 제공합니다. 동시에 HBase는 빅데이터 분야에서 널리 사용되는 분산형 NoSQL 데이터베이스 시스템입니다.

HBase 캐싱 기술에 대해 알아보기 Jun 20, 2023 pm 07:15 PM

HBase는 대규모 정형 데이터를 저장하고 처리하도록 설계된 Hadoop 기반 분산 스토리지 시스템입니다. 읽기 및 쓰기 성능을 최적화하기 위해 HBase는 합리적인 구성을 통해 쿼리 효율성을 향상하고 읽기 및 쓰기 지연을 줄일 수 있는 다양한 캐싱 메커니즘을 제공합니다. 이 기사에서는 HBase 캐싱 기술과 이를 구성하는 방법을 소개합니다. HBase 캐시 유형 HBase는 블록 캐시(BlockCache)와 MemStore 캐시(쓰기 캐시라고도 함)라는 두 가지 기본 캐시 메커니즘을 제공합니다. 블록 캐시는

See all articles