Data story

UnQLite4

UnQLite is an Embeddable NoSQL (Key/Value store and Document-store) database engine. Unlike most other NoSQL databases, UnQLite does not have a separate server process. UnQLite reads and writes directly to ordinary disk files. A complete database with multiple collections, is contained in a single disk file. The database file format is cross-platform, you can freely copy a database between 32-bit and 64-bit systems or between big-endian and little-endian architectures.UnQLite features includes:

More information on the official website: http://www.unqlite.org/

Wikipedia Recent Changes Map1

http://rcmap.hatnote.com/

When an unregistered user edits Wikipedia, he or she is identified by his or her IP address. These IP addresses are translated to users’ approximate geographic location. Edits by registered users do not have associated IP information, so the map actually represents only a small portion of the total edit activity on Wikipedia.

Built using d3DataMapsfreegeoip.net, and the Wikimedia RecentChanges IRC feed, broadcast through wikimon. Sourceavailable on github.

Built by Stephen LaPorte and Mahmoud Hashemi.

Google will soon worth more than Apple1

Google now worth more than Microsoft and will soon worth more than Apple

GOOG:US  858.800 USD13.080 1.55%

Google glass is getting hyped and trashed all at the same time and it’s not even here yet. Meanwhile, Android’s marketplace dominance and Google’s nicely executed moves to mobile ads are contributing to the valuation. And of course, Microsoft is suffering as their tablet/smartphone offerings flounder and the PC business that they dominate shrinks.

It’s an interesting thought, thinking through what company could be the next one to reach the quarter-trillion valuation mark, which is the valuation that both Google and Microsoft just recently shot past. In the far future, could (or perhaps not too far into the future, as in later this year?), Oracle and Cisco and Intel reach that plateau? They each have been worth that much before.

dotnetConf – Applied NoSQL in .NET1

Live video from the dotnetConf

Perhaps you’ve heard about the next generation of databases roughly classified as NoSQL databases? These databases are generally much better than RDBMS at scaling, performance, and ease-of-development (e.g. in NoSQL the object-relational impedance mismatch usually disappears). Unfortunately, many talks on NoSQL are very academic and general. Not this one. This session will introduce the ideas around the so-called NoSQL movement, and we’ll learn how to leverage MongoDB (a popular open source NoSQL db) to build .NET applications using LINQ as the data access language. We’ll build out a .NET application using LINQ and MongoDB in a series of interactive demos using Visual Studio 2012 and C#.

 

Redis 2.6.13 has been released1

Redis 2.6.13 has been released, it is a recommended upgrade and especially suggested if you experienced:

1) Strange issues with Lua scripting.

2) Not reconfigured reappearing master using Sentinel.

3) Server continusly trying to save on save error.

(This version of Redis may also help with AOF and slow / busy disks and latency issues.)

* [FIX] Throttle BGSAVE attempt on saving error.
* [FIX] redis-cli: raise error on bad command line switch.
* [FIX] Redis/Jemalloc Gitignore were too aggressive.
* [FIX] Test: fix RDB test checking file permissions.
* [FIX] Sentinel: always redirect on master->slave transition.
* [FIX] Lua updated to version 5.1.5. Fixes rare scripting issues.
* [NEW] AOF: improved latency figures with slow/busy disks.
* [NEW] Sentinel: turn old master into a slave when it comes back.
* [NEW] More explicit panic message on out of memory.
* [NEW] redis-cli: --latency-history mode implemented.

Download: http://redis.io/download

Light Table 0.4 has been released1

Light Table 0.4 has been released and can be downloaded  here
Full Changes list include:
  • FIX: change bundle id for Mac .app
  • FIX: make the fuzzy matching take separators into account
  • FIX: setting the exclude path didn’t take effect until restart
  • FIX: remove errant print statement (#405)
  • FIX: pipe separator highlights (#406)
  • FIX: dramatically improve rendering performance.
  • FIX: correctly parse version parts to numbers for comparison.
  • FIX: set syntax needed a better error message and description (#388)
  • FIX: better searching of the PATH on windows
  • FIX: don’t fail startup if a file/folder in a workspace was deleted
  • FIX: default exclude pattern was too greedy
  • FIX: handle semi-colonless JS much better
  • FIX: remove the tab symbols from the solarized theme
  • FIX: workspace buttons no longer overflow
  • FIX: handle the no available client much more gracefully
  • ADDED: the ability to split the window into multiple tabsets
  • ADDED: you can now have multiple windows open (Cmd/Ctrl-Shift-N to open a window, Cmd/Ctrl-Shift-W to close)
  • ADDED: python eval!
  • ADDED: ipython client integration
  • ADDED: nodejs client
  • ADDED: browser tab Browser: add browser tabBrowser: refresh active browser tab
  • ADDED: browser client using chrome-devtools
  • ADDED: Magical JS VM patching for live updates through the devtools integration
  • ADDED: command grouping
  • ADDED: connect tab that now shows which clients are active
  • ADDED: you can now unset a client from an editor
  • ADDED: connect tab now has add connection that lists all available client types
  • ADDED: executing a command by name with a keybinding will prompt you with the keybinding
  • ADDED: token-based auto-complete (press tab after a character)
  • ADDED: trailing whitespace is now removed on save (use the toggle remove trailing whitespace command to disable)
  • ADDED: line-ending detection on save
  • ADDED: You can now eval any arbitrary selection, just select text and press cmd/ctrl+enter
  • ADDED: Better styling for filter lists
  • ADDED: greatly improved startup time
  • ADDED: new folder, new file, rename, and delete to workspace context menu
  • ADDED: workspaces now watch the file system for changes
  • ADDED: Inline inspectable results for Javascript
  • ADDED: Console inspectable results for Javascript
  • ADDED: A greatly improved console with source information
  • ADDED: You can now put the console in a tab via the Console: Open the console in a tab command
  • ADDED: cancelable eval for Clojure and Python
  • ADDED: editor context menu for cut/copy/paste
  • ADDED: Light Table Docs! Docs: Open Light Table's documentation
  • ADDED: Recent workspaces are remembered, added Workspace: Create new workspace
  • CHANGED: clients tab is now connect
  • CHANGED: moved to acorn for Javascript parsing instead of Esprima
  • CHANGED: completely remove JQuery for significant memory performance increases
  • UPDATED: latest codemirror

More details available here:

http://www.chris-granger.com/2013/04/28/light-table-040/

New Flashbook: DB2 10.5 with BLU Acceleration1

A free ebook will be available for you to download in the coming days on this page

 

 

Just in time for IDUG, Paul Zikopoulos and his team of co-authors have created a new ebook for you to deepen your skills in regards to the latest release.  Here are some details about the flashbook:

Title:

DB2 10.5 with BLU Acceleration - New Dynamic In-Memory Analytics for the Era of Big Data

Authors:

Paul Zikopoulos, Matthew Huras, George Baklarz, Sam Lightstone, Aamer Sachedina

Technical editor: Roman B. Melnyk

Coverage includes:

  • Speed of Thought Analytics with new BLU Acceleration

  • Always Available Transactions with enhanced pureScale reliability

  • Unprecedented Affordability with optimization for SAP workloads

  • Future Proof Versatility with business grade NoSQL and mobile database for greater application flexibility

About the book:

If big data is an untapped natural resource, how can you find the gold dust hidden within?  Leaders realize that big data means all data, and are moving quickly to understand both structured and unstructured application data.  However, analyzing this data without impacting the performance and reliability of essential business applications can prove costly and complex.

In the new era of big data, businesses require data systems that can blend always available transactions with speed of thought analytics.  DB2 10.5 with new BLU Acceleration provides this speed, simplicity and cost efficiency while providing the ability to build next-generation applications with NoSQL features.

With this book, you’ll learn about the power and flexibility of multi-workload, multi-platform database software.  Use the comprehensive knowledge from this book to get started with the latest DB2 release by downloading the trial version.  Visit ibm.com/developerworks/downloads/im/db2/

LevelDB a fast and lightweight key/value database library by Google1

LevelDB a fast and lightweight key/value database library by Google

https://code.google.com/p/leveldb/

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

Features

  • Keys and values are arbitrary byte arrays.
  • Data is stored sorted by key.
  • Callers can provide a custom comparison function to override the sort order.
  • The basic operations are Put(key,value)Get(key)Delete(key).
  • Multiple changes can be made in one atomic batch.
  • Users can create a transient snapshot to get a consistent view of data.
  • Forward and backward iteration is supported over the data.
  • Data is automatically compressed using the Snappy compression library.
  • External activity (file system operations etc.) is relayed through a virtual interface so users can customize the operating system interactions.
  • Detailed documentation about how to use the library is included with the source code.

Limitations

  • This is not a SQL database. It does not have a relational data model, it does not support SQL queries, and it has no support for indexes.
  • Only a single process (possibly multi-threaded) can access a particular database at a time.
  • There is no client-server support builtin to the library. An application that needs such support will have to wrap their own server around the library.

Performance

Here is a performance report (with explanations) from the run of the included db_bench program. The results are somewhat noisy, but should be enough to get a ballpark performance estimate.

Setup

We use a database with a million entries. Each entry has a 16 byte key, and a 100 byte value. Values used by the benchmark compress to about half their original size.

   LevelDB:    version 1.1
   Date:       Sun May  1 12:11:26 2011
   CPU:        4 x Intel(R) Core(TM)2 Quad CPU    Q6600  @ 2.40GHz
   CPUCache:   4096 KB
   Keys:       16 bytes each
   Values:     100 bytes each (50 bytes after compression)
   Entries:    1000000
   Raw Size:   110.6 MB (estimated)
   File Size:  62.9 MB (estimated)

 

Write performance

The “fill” benchmarks create a brand new database, in either sequential, or random order. The “fillsync” benchmark flushes data from the operating system to the disk after every operation; the other write operations leave the data sitting in the operating system buffer cache for a while. The “overwrite” benchmark does random writes that update existing keys in the database.

 

   fillseq      :       1.765 micros/op;   62.7 MB/s     
   fillsync     :     268.409 micros/op;    0.4 MB/s (10000 ops)
   fillrandom   :       2.460 micros/op;   45.0 MB/s     
   overwrite    :       2.380 micros/op;   46.5 MB/s

 

Each “op” above corresponds to a write of a single key/value pair. I.e., a random write benchmark goes at approximately 400,000 writes per second.

Each “fillsync” operation costs much less (0.3 millisecond) than a disk seek (typically 10 milliseconds). We suspect that this is because the hard disk itself is buffering the update in its memory and responding before the data has been written to the platter. This may or may not be safe based on whether or not the hard disk has enough power to save its memory in the event of a power failure.

Read performance

We list the performance of reading sequentially in both the forward and reverse direction, and also the performance of a random lookup. Note that the database created by the benchmark is quite small. Therefore the report characterizes the performance of leveldb when the working set fits in memory. The cost of reading a piece of data that is not present in the operating system buffer cache will be dominated by the one or two disk seeks needed to fetch the data from disk. Write performance will be mostly unaffected by whether or not the working set fits in memory.

 

   readrandom   :      16.677 micros/op;  (approximately 60,000 reads per second)
   readseq      :       0.476 micros/op;  232.3 MB/s    
   readreverse  :       0.724 micros/op;  152.9 MB/s

 

LevelDB compacts its underlying storage data in the background to improve read performance. The results listed above were done immediately after a lot of random writes. The results after compactions (which are usually triggered automatically) are better.

 

   readrandom   :      11.602 micros/op;  (approximately 85,000 reads per second)   
   readseq      :       0.423 micros/op;  261.8 MB/s    
   readreverse  :       0.663 micros/op;  166.9 MB/s

 

Some of the high cost of reads comes from repeated decompression of blocks read from disk. If we supply enough cache to the leveldb so it can hold the uncompressed blocks in memory, the read performance improves again:

   readrandom   :       9.775 micros/op;  (approximately 100,000 reads per second before compaction)
   readrandom   :       5.215 micros/op;  (approximately 190,000 reads per second after compaction)

Oracle NoSQL Database 2.0.39 released1

Oracle NoSQL Database 2.0.39 has been released and introduce several improvements, a couple of new Oracle product integration points as well as a number of important bug fixes. These new features and fixes include:

- An integration with Oracle Coherence has been provided that allows Oracle NoSQL Database to be used as a cache for Oracle Coherence applications, also allowing applications to directly access cached data from Oracle NoSQL Database. Documentation can be foundhttp://bit.ly/14e6jEP.

- Oracle NoSQL Database Enterprise Edition now has support for semantic technologies. Specifically, the Resource Description Framework (RDF), SPARQL query language, and a subset of the Web Ontology Language (OWL) are now supported. These capabilities are referred to as the RDF Graph feature of Oracle NoSQL Database. The RDF Graph feature provides a Java-based interface to store and query semantic data in Oracle NoSQL Database Enterprise Edition. Documentation can be found http://bit.ly/Y7aQX4.

Find the complete list of changes in the change log.

Changelog: http://bit.ly/ZweZDS
Download: http://bit.ly/yLGVg3

VoltDB v3.2 has been released1

VoltDB v3.2 has been  released and can be downloaded here: http://voltdb.com/community/downloads.php

Changes include:

  • Enhanced Support for Live Schema Updates
  • Improved Performance and Resilience of Catalog Updates
  • New Return Status for Snapshot Restore
  • hange to the Default Heartbeat Timeout

 

The following issues have been fixed:

  • Automated snapshots and node failure

    It was possible for automated snapshots to silently stop occurring after a node failed and rejoined the cluster. This did not happen all the time, but could not be corrected without restarting the cluster. This issue has been corrected.

  • The sqlcmd command and stored procedure names

    Previously, the sqlcmd command line tool could not invoke a stored procedure if the procedure name started with a SQL statement keyword, such as “select” or “delete”. This issue has been corrected.

  • Enterprise Manager fails to recognize cluster changes

    In recent versions of VoltDB, it was possible for the Enterprise Manager to start a database cluster but not recognize when the database completed startup. Similarly, if a node failed to rejoin or a recover operation did not complete the Enterprise Manager might not recognize these conditions. The symptom in all cases was that the database or server icon would not stop “spinning” in the Enterprise Manager control panel. These issues are now fixed.

Follow LuxNoSQL on Twitter
 
Join the LuxNoSQL Community on LinkedIn