Data story

MongoDB 2.6 released1

MongoDB 2.6 has been released with new majors features as primary target, but it also improve performance.

Performance improvements:

  • efficient use of network resources
  • oplog processing is 75% faster
  • classes of scan, sort, $in and $all performance are significantly improved
  • bulk operators for writes improve updates by as much as 5x.

Features improvements:

  • Text Search Integration
  • Insert and Update Improvements
  • A new write protocol integrates write operations with write concerns(The protocol also provides improved support for bulk operations)
  • A new authorization model that provides the ability to create custom User-Defined Roles and the ability to specify user privileges at a collection-level granularity.

Full release note

PostgreSQL introduced jsonb support3

Binary JSON

PostgreSQL has introduce jsonb.. a diamond in the crown of PostgreSQL 9.4.Based on an elegant hash opclass for GIN, which competes with MongoDB performance in contains operator .

Feature’s documentation : http://www.postgresql.org/docs/devel/static/datatype-json.html

Feature’s story:  http://obartunov.livejournal.com/177247.html

hbase-0.98.0 has been released1

hbase-0.98.0 has been released

This release includes:

  • several new security features like cell visibility labels, cell ACLs, and transparent server side encryption.
  • significant performance improvements, such as a change to the write ahead log threading model that provides higher transaction throughput under high load, reverse scanners, MapReduce over snapshot files, and striped compaction

The complete list of changes in this release can be found in the release notes: http://goo.gl/y25W2h

What do you know about SQL performance?1

The 3-Minute Test: What do you know about SQL performance?

“SQL-Tuning is black magic like alchemy: it consists of obscure rules, understood only by a handful of insiders.”

That is a myth. SQL databases use well-known algorithms to deliver predictable performance. It is, however, easy to write SQL queries that cannot use the most efficient algorithm and thus deliver unexpected performance.

 http://use-the-index-luke.com/3-minute-test

 

 

Parallel programming1

parallel_programming

VoltDB 4.02

VoltDB 4.0 is now available!

 The highlights of VoltDB v4.0 include:

  • Enhanced in-memory analytics capabilities with a host of new SQL support.
  • Greatly improved analytic read throughput performance.
  • Clusters can grow elastically, increasing both throughput and capacity, by adding nodes to running clusters without blocking ongoing operations.
  • Support for Groovy stored procedures, a message queue export connector, a MySQL migration utility and a host of other features.
  • Online training, free, at Volt University, along with Volt Vanguard Certification.

Here’s the details on what’s new in VoltDB v4.0.  You can download it here

Official announce from VoltDB blog: http://voltdb.com/announcing-voltdb-4-0-enhanced-in-memory-analytics-and-online-elasticity/

Enhanced In-Memory Analytics

VoltDB is renown for its ability to execute very fast writes – we’ve benchmarked writes into the millions of transactions per second range, on small clusters, running on bare metal as well as cloud instances.

But fast writes without fast reads are less useful. Since its first version, VoltDB has allowed for transactional reads to support writes as well as provide a window into fast changing data.

At ingestion or processing time, stored procedures can transactionally perform lookups and queries as data is coming into the system, allowing for richer writes at scale. Separately, global transactional reads can trigger events, support dashboards and even live decisioning on immediate data. Mixing complex reads and writes transactionally, and at scale, has traditionally separated VoltDB from other write-heavy systems.

In 4.0, VoltDB has added both features and improved performance of analytic-focused read queries. We’re focused on helping users understand their data as soon as they have it.

First, VoltDB delivers major new SQL capabilities, now supporting SQL UNION, self/outer/explicit JOIN, CASE, HAVING, SQL IN, Group-by column functions and materialized view group-by column functions. Our SQL support is approaching SQL-92 compatibility, while also adding non-standard features to support our key use cases. For example, VoltDB can now build a materialized view that aggregates the value of a JSON field by 5-minute time windows.

Second, we’ve removed some of the transactional overhead when running many kinds of global read queries, including ad-hoc SQL. These queries are still reading a live, fully serializable view of committed data, but they’re now up to 50x faster. This directly translates into more powerful dashboards, more consumers of analytics and richer decisioning.

Online Elastic Database

As a natively clustered database, VoltDB can scale to meet the needs of almost any high-velocity application. While some users have megabytes of state, others have terabytes. While some users process hundreds of operations per second, others process millions. But what if your business is growing and you want your VoltDB cluster to grow with it? Since we shipped VoltDB 1.0, users have been asking to add nodes to the cluster without any interruption in service, to rebalance data in the background while their apps continue to work for them.

We call this feature “elasticity” and it’s shipping in VoltDB 4.0. VoltDB can now seamlessly add nodes to a running cluster, increasing storage and throughput with each new node. Since all topology changes and data movement are transactional and durable, your data is protected while the cluster is expanding or rebalancing. We’ve also carefully engineered this feature so that most customers will see no impact to their workload during expansion.

Online Operations

In addition to the new online elasticity, VoltDB also supports online catalog and schema updates.  Tables can be added or dropped. Columns, indexes and materialized views can be added, dropped and modified.  And of course, stored procedures can be added, updated, or deleted.

VoltDB upgraded its network partition and fault detection handling.  In the real world, network partitions happen for a variety of reasons. Both hardware and software issues can cause a node to become unavailable, or unreachable to other nodes in a cluster. For version 4.0, VoltDB enhanced it’s fault detection and recovery functionality to remain available in the face even more kinds of failure.  The result is increased availability of VoltDB clusters where links might break, for example, when VoltDB nodes are running in different availability zones.

Groovy – Our First Non-Java Stored Procedure Language

In VoltDB 4.0 we’ve added the ability to code the procedure implementation in the DDL itself with inline Groovy scripts. Check out our Groovy Voter sample for a familiar example of VoltDB processing in Groovy.  We hope this is the first of many new stored procedure languages – please drop us a note if you have a favorite, or would like to contribute to this effort.

Integrations, Migrations and Miscellaneous Features

VoltDB v4.0 introduces a whole host of other features. I’ll run through them here quickly:

  • VoltDB Export allows you to transactionally push data from VoltDB into another system, similar to an ETL (extract, transform, load) process. In 3.x, we could export to systems using JDBC, as well as to flat files. VoltDB 4.0 adds a new Export connector to leverage message queues. Presently available as a Beta, you can now export to a message queue using Kafka. Feel free to contact us if you’d like early access.
  • We’ve added three new @Statistics selectors that can help identify performance hot spots in your application.  They are PROCEDUREPROFILE, identifying the percentage of execution time each procedure takes, PROCEDUREINPUT, identifying the breakdown of data flow into stored procedures, and PROCEDUREOUTPUT, identifying the result set data flow from stored procedures.  These new statistics help you to quickly identify what transactions are taking the most time or I/O in your application.
  • You can now get a leg up porting your MySQL to VoltDB.  Our new utility, fondly called Voltify, will extract your database schema from your MySQL database and create a VoltDB catalog automatically.  This utility, coupled with our high performance CSV loader, enables you to rapidly move your MySQL database to VoltDB.
  • The VoltDB JDBC driver has been enhanced to support parameterized ad hoc SQL statements, setting query timeouts, as well as additional metadata methods.

VoltDB Training

VoltDB recently rolled out a new offering, Volt University, offering free online training on VoltDB key concepts.  There are ten lessons, found at http://voltdb.com/resources/volt-university/tutorials/ that can help you come up to VoltDB-speed quickly. Additionally, should you wish formal Volt Vanguard certification, we are offering an online official certification course, which you can register for here: https://university.voltdb.com/

Download VoltDB 4.0

We here at VoltDB are very excited about the release and hope you are too.  You can download VoltDB 4.0 here.

 

Couchbase Server 2.5 released1

Couchbase Server 2.5 has been released, it includes the following new features and enhancement:

  • Rack Awareness (Enterprise Edition only)
  • XDCR data security (Enterprise Edition only)
  • Optimized connection management

 

High Availability with Rack Awareness

To ensure enterprise-class availability and reliability, master data and replicated data should be stored on different server racks. Couchbase Server 2.5 Enterprise Edition’s newly introduced Rack Awareness provides a simple, flexible and effective solution for data replication that is easy to scale and administer. With Couchbase Rack Awareness, the user can create logical groupings of Couchbase Server nodes and replica copies of the data are automatically distributed across server nodes located on different racks. This intelligent data replication ensures that data is secure despite disruptions such as power outages, or switch or rack failure.

Rack Awareness is especially needed for applications running on public clouds, such as Amazon EC2, where customers have no control over infrastructure availability and uptime. With Couchbase Server Enterprise Edition 2.5, customers running applications on a public cloud can leverage Rack Awareness to ensure that replica data is stored on separate zones to maintain 24/7 application uptime.

 

Download http://www.couchbase.com/download

Hypertable version 0.9.7.16 released1

Hypertable Version 0.9.7.16 has been released, its brins the following changes:

  • Upgraded to C++11 compiler
  • issue 1179: Fixed insert perf problem introduced by bad commit in 0.9.7.13
  • issue 1193: Fixed split_row/end_row comparison in Range::estimate_split_row()
  • Fixed memory leak in index table mutator
  • Avoid aggressive merging during low memory mode
  • Fixed BalancePlanAuthority::change_receiver_plan_location() to properly increment generation
  • issue 1191: Fixed DEB and RPM package installation
  • Fixed alloc-dealloc-mismatch error in hypertable_ldi_select_test
  • Fixed HQL-delete test
  • Fixed Spirit parser issues
  • issue 1104: Fixed intermittent failure of issue190 test
  • issue 1123: fixed ldd.sh script
  • Got rid of INFO log message in OperationRecover::decode_state()
  • issue 1193: Replaced assert with instrumentation logging
  • issue 1189: Propagate exceptions from ~TableMutator()
  • Modified issue890 test to compile java file into build directory
  • Modified Filesystem::readdir to return vector of Dirent structures
  • Reverted “merging compactions ahead of minor compactions” commit
  • Allow arbitrary column selection for secondary index queries
  • issue 1032: Fixed COUNTER columns “wrap around” on underflow problem
  • Added NO_CACHE option to SELECT statement

 

Download http://hypertable.com/download/09716

Relate note http://cdn.hypertable.com/packages/0.9.7.16/CHANGES

DeepDB2

DeepDB provides simultaneous transactions and analytics (row store and column store)—in the same data set, in
real-time. Official website : http://deep.is/

It claims to be fully transactional (ACID compliant) and introduces breakthroughs covering six fundamental attributes to the performance:

 

Constant-Time Indexing

Minimizes indexing cost, enabling highly
indexed databases

Updates indexes in real-time, in-memory 
and on disk

Uses summary indexing to achieve
yottabyte scale

Segmented Column Store

Adds columnar attributes to table-
oriented indexes

Embeds meta-data including
statistical aggregation

Allows for delta updates instead
of a full column rebuild

 

Streaming I/O

Massively optimized, enabling wire-
speed throughput

Concurrent operations for updates 
in-memory and on-disk

Optimizations for SSD, HDD, and 
in-memory-only operation

 

Intelligent Caching

Eliminates on-disk tree traversals

Adaptive segment sizes (no fixed pages)

Point-read capable,
retrieves only what is necessary

 

Adaptive Concurrency

Minimizes delays and wait states to maximize
CPU throughput

Fine-grained lightweight
locking mechanisms

Eliminates most OS context switches

 

Designed for the Cloud

Continually optimizing system

Eliminates downtime for
scheduled maintenance

Zero-touch adaptive configuration

 

About FoundationDB 2.01

FoundationDB 2.0 combines the power of ACID transactions with the scalability, fault tolerance, and operational elegance of distributed NoSQL databases. This release was driven by specific customer feedback for increased language support, network security, and higher-level tools for managing data within FoundationDB.

FoundationDB 2.0 adds Go and PHP to the list of languages with native FoundationDB support.

Along with the additional language and layer support, 2.0 also ships with full Transport Layer Security which encrypts all FoundationDB network traffic, enabling security and authentication between both servers and clients via a public/private key infrastructure.

Also in 2.0, monitoring improvements report more detailed information about potential low-memory scenarios even before they happen.

FoundationDB 2.0 is backwards-compatable with all previous API versions, so any code that you wrote against an old version of FoundationDB will still run; there have been minimal API changes so updating your code to the new API version will be a snap.

Download FoundationDB 2.0

Upgrade as documented here (just remember that you’ll need to upgrade both clients and servers at the same time).

More information on the  Google Group 

Follow LuxNoSQL on Twitter
 
Join the LuxNoSQL Community on LinkedIn