IaaS Provider iland chooses Cassandra over MongoDB
iland chose Apache Cassandra over MongoDB because Cassandra provides constant time writes no matter how big the data set grows and for its distributed nature as well as its massive scalability, reliability, performance, availability, consi

-Julien Anguenot, ?Director of Software Engineering at iland
Julien Anguenot ?Director of Software Engineering at iland
Follow @anguenot
iland
iland internet solutions, founded in 1995, is a pure IaaS player providing enterprise cloud infrastructure and services with several datacenters in North America, Europe and Asia.
iland platform with Cassandra
Cassandra is the sole database leveraged by the land platform, distributed across iland’s datacenters, which is the foundation of the customer-facing ECS2 iland portal.
The platform stores time series: real-time (20 seconds samples coming from vSphere) and historical rollups (1m, 1h, 1d, 1w and 1month) for dozens of virtual machine’s performance counters, corresponding resource pools and networks.
Also, Cassandra stores usage corresponding real-time and historical billing information as well as infrastructure configuration, user information etc. The platform also provides predictive analytics that help companies monitor performance, achieve consistency and anticipate growth requirements.
The iland portal is essentially an easy to use and understand front end (web and mobile) for the iland platform solutions – it covers a wealth of functionality including offering visibility into resource consumption, billing, performance, the impact of change and other key areas. It also provides usage and billing based alerts as well as cloud management features.
Evaluating MongoDB and Cassandra
iland chose Apache Cassandra over MongoDB because Cassandra provides constant time writes no matter how big the data set grows and for its distributed nature as well as its “massive” scalability, reliability, performance, availability, consistency and simplicity.
Constant-time writes no matter how big the data is a must for our real-time performance counters collection since the amount of virtual machines to collect from will increase to ten of thousands along with workers concurrently performing operations at the application level.
Multi-datacenter deployment
We use a Cassandra 2.0.x cluster distributed across 5 datacenters (Los Angeles, CA – Reston, VA – London, UK – Manchester, UK – Singapore)
iland uses the 20x Debian deb packages hosted by the Apache foundation on Ubuntu 12.04 LTS.
We use CQL3 over thrift at the moment, using Astyanax, but we are planning to switch to the DataStax CQL Java driver when Astyanax 2.0 will be released.
Each datacenter has at least one rack of 3 nodes and all data is replicated across all nodes in the cluster.
To date: total cluster nodes is 18 and and we are getting close to 1TB of data (application has been deployed empty in September 2013 with the iland ECS2 brand new offering with no legacy data to migrate over to Cassandra)
?
(RF = replication factor)
?
RF = 3, W – LOCAL_QUORUM (2 nodes), R – LOCAL_QUORUM (2 nodes)
?
This configuration allows for a single node to fail while still serving both READS and WRITES. This setup comes at a cost of having a larger data footprint in terms of storage size. This allows us to use Cassandra’s tunable consistency to our advantage and ensure that all reads are consistent, yet keeping our availability as high as possible when running on 3 nodes.
?
Cassandra nodes are running off Ubuntu powered virtual machines (in vSphere). Each node has 16GB of RAM and 8 vCPUs
?
Getting started
My advice would be to start w/ a single-node instance to avoid clustering related concerns initially and use CQL3 (vs thrift) from the start.
?
Documentation is great, issue tracker and mailing list are great source of information, upgrade and maintenance of Cassandra are painless and drivers such as the DataStax CQL drivers for Python or Java as well as Netflix’s Astyanax have been working just great for us.
原文地址:IaaS Provider iland chooses Cassandra over MongoDB, 感谢原作者分享。

熱AI工具

Undresser.AI Undress
人工智慧驅動的應用程序,用於創建逼真的裸體照片

AI Clothes Remover
用於從照片中去除衣服的線上人工智慧工具。

Undress AI Tool
免費脫衣圖片

Clothoff.io
AI脫衣器

Video Face Swap
使用我們完全免費的人工智慧換臉工具,輕鬆在任何影片中換臉!

熱門文章

熱工具

記事本++7.3.1
好用且免費的程式碼編輯器

SublimeText3漢化版
中文版,非常好用

禪工作室 13.0.1
強大的PHP整合開發環境

Dreamweaver CS6
視覺化網頁開發工具

SublimeText3 Mac版
神級程式碼編輯軟體(SublimeText3)

全表掃描在MySQL中可能比使用索引更快,具體情況包括:1)數據量較小時;2)查詢返回大量數據時;3)索引列不具備高選擇性時;4)複雜查詢時。通過分析查詢計劃、優化索引、避免過度索引和定期維護表,可以在實際應用中做出最優選擇。

是的,可以在 Windows 7 上安裝 MySQL,雖然微軟已停止支持 Windows 7,但 MySQL 仍兼容它。不過,安裝過程中需要注意以下幾點:下載適用於 Windows 的 MySQL 安裝程序。選擇合適的 MySQL 版本(社區版或企業版)。安裝過程中選擇適當的安裝目錄和字符集。設置 root 用戶密碼,並妥善保管。連接數據庫進行測試。注意 Windows 7 上的兼容性問題和安全性問題,建議升級到受支持的操作系統。

InnoDB的全文搜索功能非常强大,能够显著提高数据库查询效率和处理大量文本数据的能力。1)InnoDB通过倒排索引实现全文搜索,支持基本和高级搜索查询。2)使用MATCH和AGAINST关键字进行搜索,支持布尔模式和短语搜索。3)优化方法包括使用分词技术、定期重建索引和调整缓存大小,以提升性能和准确性。

聚集索引和非聚集索引的區別在於:1.聚集索引將數據行存儲在索引結構中,適合按主鍵查詢和範圍查詢。 2.非聚集索引存儲索引鍵值和數據行的指針,適用於非主鍵列查詢。

MySQL是一個開源的關係型數據庫管理系統。 1)創建數據庫和表:使用CREATEDATABASE和CREATETABLE命令。 2)基本操作:INSERT、UPDATE、DELETE和SELECT。 3)高級操作:JOIN、子查詢和事務處理。 4)調試技巧:檢查語法、數據類型和權限。 5)優化建議:使用索引、避免SELECT*和使用事務。

MySQL 和 MariaDB 可以共存,但需要謹慎配置。關鍵在於為每個數據庫分配不同的端口號和數據目錄,並調整內存分配和緩存大小等參數。連接池、應用程序配置和版本差異也需要考慮,需要仔細測試和規劃以避免陷阱。在資源有限的情況下,同時運行兩個數據庫可能會導致性能問題。

數據集成簡化:AmazonRDSMySQL與Redshift的零ETL集成高效的數據集成是數據驅動型組織的核心。傳統的ETL(提取、轉換、加載)流程複雜且耗時,尤其是在將數據庫(例如AmazonRDSMySQL)與數據倉庫(例如Redshift)集成時。然而,AWS提供的零ETL集成方案徹底改變了這一現狀,為從RDSMySQL到Redshift的數據遷移提供了簡化、近乎實時的解決方案。本文將深入探討RDSMySQL零ETL與Redshift集成,闡述其工作原理以及為數據工程師和開發者帶來的優勢。

MySQL 數據庫中,用戶和數據庫的關係通過權限和表定義。用戶擁有用戶名和密碼,用於訪問數據庫。權限通過 GRANT 命令授予,而表由 CREATE TABLE 命令創建。要建立用戶和數據庫之間的關係,需創建數據庫、創建用戶,然後授予權限。
