IaaS Provider iland chooses Cassandra over MongoDB

WBOY
풀어 주다: 2016-06-07 16:35:46
원래의
1070명이 탐색했습니다.

iland chose Apache Cassandra over MongoDB because Cassandra provides constant time writes no matter how big the data set grows and for its distributed nature as well as its massive scalability, reliability, performance, availability, consi

iland
 

“iland chose Apache Cassandra over MongoDB because Cassandra provides constant time writes no matter how big the data set grows and for its distributed nature as well as its “massive” scalability, reliability, performance, availability, consistency and simplicity.”
 

-Julien Anguenot, ?Director of Software Engineering at iland

Julien Anguenot

Julien Anguenot ?Director of Software Engineering at iland

Follow @anguenot

iland
iland internet solutions, founded in 1995, is a pure IaaS player providing enterprise cloud infrastructure and services with several datacenters in North America, Europe and Asia.

iland platform with Cassandra

Cassandra is the sole database leveraged by the land platform, distributed across iland’s datacenters, which is the foundation of the customer-facing ECS2 iland portal.

The platform stores time series: real-time (20 seconds samples coming from vSphere) and historical rollups (1m, 1h, 1d, 1w and 1month) for dozens of virtual machine’s performance counters, corresponding resource pools and networks.

Also, Cassandra stores usage corresponding real-time and historical billing information as well as infrastructure configuration, user information etc. The platform also provides predictive analytics that help companies monitor performance, achieve consistency and anticipate growth requirements.

The iland portal is essentially an easy to use and understand front end (web and mobile) for the iland platform solutions – it covers a wealth of functionality including offering visibility into resource consumption, billing, performance, the impact of change and other key areas. It also provides usage and billing based alerts as well as cloud management features.

Evaluating MongoDB and Cassandra

iland chose Apache Cassandra over MongoDB because Cassandra provides constant time writes no matter how big the data set grows and for its distributed nature as well as its “massive” scalability, reliability, performance, availability, consistency and simplicity.

Constant-time writes no matter how big the data is a must for our real-time performance counters collection since the amount of virtual machines to collect from will increase to ten of thousands along with workers concurrently performing operations at the application level.

Multi-datacenter deployment

We use a Cassandra 2.0.x cluster distributed across 5 datacenters (Los Angeles, CA – Reston, VA – London, UK – Manchester, UK – Singapore)

iland uses the 20x Debian deb packages hosted by the Apache foundation on Ubuntu 12.04 LTS.

We use CQL3 over thrift at the moment, using Astyanax, but we are planning to switch to the DataStax CQL Java driver when Astyanax 2.0 will be released.

Each datacenter has at least one rack of 3 nodes and all data is replicated across all nodes in the cluster.

To date: total cluster nodes is 18 and and we are getting close to 1TB of data (application has been deployed empty in September 2013 with the iland ECS2 brand new offering with no legacy data to migrate over to Cassandra)

?

(RF = replication factor)

?

RF = 3, W – LOCAL_QUORUM (2 nodes), R – LOCAL_QUORUM (2 nodes)

?

This configuration allows for a single node to fail while still serving both READS and WRITES. This setup comes at a cost of having a larger data footprint in terms of storage size. This allows us to use Cassandra’s tunable consistency to our advantage and ensure that all reads are consistent, yet keeping our availability as high as possible when running on 3 nodes.

?

Cassandra nodes are running off Ubuntu powered virtual machines (in vSphere). Each node has 16GB of RAM and 8 vCPUs

?

Getting started

My advice would be to start w/ a single-node instance to avoid clustering related concerns initially and use CQL3 (vs thrift) from the start.

?

Documentation is great, issue tracker and mailing list are great source of information, upgrade and maintenance of Cassandra are painless and drivers such as the DataStax CQL drivers for Python or Java as well as Netflix’s Astyanax have been working just great for us.

관련 라벨:
원천:php.cn
본 웹사이트의 성명
본 글의 내용은 네티즌들의 자발적인 기여로 작성되었으며, 저작권은 원저작자에게 있습니다. 본 사이트는 이에 상응하는 법적 책임을 지지 않습니다. 표절이나 침해가 의심되는 콘텐츠를 발견한 경우 admin@php.cn으로 문의하세요.
인기 튜토리얼
더>
최신 다운로드
더>
웹 효과
웹사이트 소스 코드
웹사이트 자료
프론트엔드 템플릿