-
Notifications
You must be signed in to change notification settings - Fork 43
Add a detailed review of the key features in release 3.4 #5082
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: latest
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,249 @@ | ||
Tarantool 3.4 | ||
============= | ||
|
||
Release date: April 14, 2024 | ||
|
||
Releases on GitHub: :tarantool-release:`3.4.0` | ||
|
||
The 3.4 release of Tarantool adds the following main product features and improvements | ||
for the Community and Enterprise editions: | ||
|
||
* **Community Edition (CE)** | ||
|
||
* Memtx <-> vinyl cross-engine transactions. | ||
* New ``index:quantile()`` function for finding a quantile key in an indexed data range. | ||
* Functional indexes in the MVCC transaction manager. | ||
* Vinyl now supports ``np`` (next prefix) and ``pp`` (previous prefix) iterators. | ||
* Fixed incorrect number comparisons and duplicates in unique indexes. | ||
* Runtime priviledges for ``lua_call`` are now granted before ``box.cfg()``. | ||
* The ``stop`` callbacks for the roles are now called during graceful shutdown, | ||
in the reverse order of roles startup. | ||
* New ``has_role``, ``is_router``, and ``is_storage`` methods in the | ||
``config`` module to check if a role is enabled on an instance. | ||
* LuaJIT profilers are now more user-friendly. | ||
* Built-in logger now encodes table arguments in the JSON format. | ||
* Multiple bugfixes for MVCC, vinyl, WAL, and snapshotting. | ||
* Fixed memory overgrowing for cdata-intensive workloads. | ||
|
||
* **Enterprise Edition (EE)** | ||
|
||
* New in-memory columnar storage engine: ``memcs``. | ||
* New bootstrap strategy in failover: ``native``. | ||
* New public API for accessing remote ``config.storage`` clusters as key-value storages. | ||
* Two-phase appointment process to avoid incorrect behavior of the failover coordinator. | ||
|
||
.. _3-4-memcs: | ||
|
||
[EE] New in-memory columnar storage engine: ``memcs`` | ||
----------------------------------------------------- | ||
|
||
The engine stores data in the memtx arena but in contrast to memtx it doesn't | ||
organize data in tuples. Instead, it stores data in columns. Each format field | ||
is assigned its own BPS tree-like structure (BPS vector), which stores values | ||
only of that field. If the field type fits in 8 bytes, raw field values are | ||
stored directly in tree leaves without any encoding. For values larger than 8 | ||
bytes, like decimal, uuid or strings, the leaves store pointers to | ||
MsgPack-encoded data. | ||
|
||
The main benefit of such data organization is a significant performance boost | ||
of columnar data sequential scans compared to memtx thanks to CPU cache | ||
locality. That's why memcs supports a special C api for such columnar scans: | ||
see `box_index_arrow_stream()` and `box_raw_read_view_arrow_stream()`. | ||
Peak performance is achieved when scanning embedded field types. | ||
|
||
Querying full tuples, like in memtx, is also supported, but the performance is | ||
worse compared to memtx, because a tuple has to be constructed on the runtime | ||
arena from individual field values gathered from each column tree. | ||
|
||
Other features include: | ||
* Point lookup. | ||
* Stable iterators. | ||
* Insert/replace/delete/update. | ||
* Batch insertion in the Arrow format. | ||
* Transactions, including cross-engine transactions with memtx | ||
(with ``memtx_use_mvcc_engine = false``). | ||
* Read view support. | ||
* Secondary indexes with an ability to specify covered columns and sequentially scan | ||
indexed + covered columns. | ||
|
||
Embedded field types include only fixed-width types: | ||
* Integer: (u)int8/16/32/64. | ||
* Floating point: float32/64. | ||
|
||
Types with external storage include: | ||
* Strings. | ||
* All the other types supported by Tarantool: UUID, Decimal, Datetime, etc. | ||
|
||
By default, NULL values are stored explicitly and use up the same space as | ||
any other valid column value (1, 2, 4 or 8 bytes depending on an exact field | ||
type), however RLE encoding of NULLs is also supported. For reference, | ||
RLE-encoding of a column with 90% evenly distributed NULL values reduces | ||
memory consumption of that column by around 5 times. | ||
|
||
.. _3-4-cross-engine: | ||
|
||
[CE] Memtx <-> vinyl cross-engine transactions | ||
---------------------------------------------- | ||
|
||
Tarantool now supports mixing statements for memtx and vinyl in the same transaction, | ||
for example: | ||
|
||
.. code-block:: lua | ||
|
||
local memtx = box.schema.space.create('memtx', {engine = 'memtx'}) | ||
memtx:create_index('primary') | ||
local vinyl = box.schema.space.create('vinyl', {engine = 'vinyl'}) | ||
vinyl:create_index('primary') | ||
|
||
memtx:insert({1, 'a'}) | ||
vinyl:insert({2, 'b'}) | ||
|
||
box.begin() | ||
memtx:replace(vinyl:get(2)) | ||
vinyl:replace(memtx:get(1)) | ||
box.commit() | ||
|
||
.. note:: | ||
|
||
* Accessing a vinyl space may trigger a fiber yield (to read a file from the disk), | ||
so MVCC must be enabled in memtx to make use of the new feature: | ||
|
||
.. code-block:: lua | ||
|
||
box.cfg{memtx_use_mvcc_engine = true} | ||
|
||
* Vinyl operations may yield implicitly, so a transaction may be aborted | ||
with TRANSACTION_CONFLICT in case of concurrent transactions. | ||
|
||
.. _3-4-native: | ||
|
||
[EE] New boostrap strategy in failover: ``native`` | ||
-------------------------------------------------- | ||
|
||
Now supervised failover coordinator supports three bootstrap strategies: | ||
native, supervised, auto. | ||
|
||
The new ``native `` strategy acts more or less similar to the ``auto`` strategy, | ||
but relaxes its limitations. It is based on the supervised strategy and basically | ||
performs two things: | ||
* issues ``box.ctl.make_bootstrap_leader({graceful = true})`` to bootstrap | ||
a replicaset; | ||
* issues ``box.ctl.make_bootstrap_leader()`` to keep the bootstrap leader | ||
record pointing to the instance that is currently in the RW mode (to register | ||
new replicas). | ||
Comment on lines
+129
to
+133
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'm not sure that the calls under the hood are really interesting for our reader. Maybe some motivational things would work better here. Quoted from my mini-announce in a team chat (in Russian):
|
||
|
||
To enable the ``native `` bootstrap strategy, set it in the ``replication`` section | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Nit: Same here regarding the whitespace. |
||
of the cluster's configuration, together with a proper failover strategy | ||
(for ``native``, you can choose any failover strategy you like, for example ``supervised``): | ||
|
||
.. code-block:: yaml | ||
|
||
replication: | ||
failover: supervised | ||
bootstrap_strategy: native | ||
|
||
.. _3-4-runtime-priv: | ||
|
||
[CE] Runtime priviledges for ``lua_call`` granted before ``box.cfg()`` | ||
---------------------------------------------------------------------- | ||
|
||
It is now possible to grant execution privileges for Lua functions | ||
through the declarative configuration, even when the database is in | ||
read-only mode or has an outdated schema version. You might also | ||
permit ``guest`` to execute Lua functions before the initial bootstrap. | ||
|
||
You can specify function permissions using the ``lua_call`` option in | ||
the configuration, for example: | ||
|
||
.. code-block:: lua | ||
|
||
credentials: | ||
users: | ||
alice: | ||
privileges: | ||
- permissions: [execute] | ||
lua_call: [my_func] | ||
|
||
This grants the ``alice`` user permission to execute the ``my_func`` Lua | ||
function, regardless of the database's mode or status. The special option | ||
``lua_call: [all]`` is also supported, granting access to all global Lua | ||
functions except built-in ones, bypassing database restrictions. | ||
|
||
Privileges will still be written to the database when possible to | ||
maintain compatibility and consistency with other privilege types. | ||
|
||
[CE] New methods in the ``config`` module to check instance roles | ||
----------------------------------------------------------------- | ||
|
||
Three new methods are now available in the ``config`` module: | ||
|
||
* ``has_role(<role_name>, {instance = <instance_name})`` returns ``true`` if | ||
the instance with the name ``<instance_name>`` has the role ``<role_name>`` | ||
enabled in the current configuration, or ``false`` if not. | ||
The second argument is optional: if not provided, the check is performed | ||
for the instance the method is called on. | ||
Comment on lines
+180
to
+184
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Nit: It feels a bit too formal for the announcement. Would look appropriate in an API documentation, but here I would make it simpler:
Same for |
||
|
||
* ``is_router({instance = <instance_name})`` returns ``true`` if the instance | ||
with the name ``<instance_name>`` is a vshard router, according to the current | ||
configuration, or ``false`` if not. | ||
The argument is optional: if not provided, the check is performed for | ||
the instance the method is called on. | ||
|
||
* ``is_storage({instance = <instance_name})`` returns ``true`` if the instance | ||
with the name ``<instance_name>`` is a vshard storage, according to the current | ||
configuration, or ``false`` if not. | ||
The argument is optional: if not provided, the check is performed for | ||
the instance the method is called on. | ||
|
||
.. _3-4-storage-client-api: | ||
|
||
[EE] New public API: ``config.storage_client`` | ||
---------------------------------------------- | ||
|
||
Remote ``config.storage`` clusters can now be accessed by using the | ||
``config.storage_client.connect(endpoints[, {options}])`` method. | ||
The returned object represents a connection to a remote key-value | ||
storage accessed through the ``:get()``, ``:put()``, ``:info()``, ``:txn()`` | ||
methods with the same signature as in the server | ||
:ref:`config.storage <config_module_api_reference>` API. | ||
|
||
The ``config.storage_client`` API has also several specific methods: | ||
``:is_connected()``, ``:watch()``, ``:reconnect()``, ``:close()``. | ||
|
||
Here are some usage examples: | ||
|
||
.. code-block:: lua | ||
|
||
-- Connect to a config.storage cluster using the endpoints | ||
-- configured in the `config.storage` section. | ||
-- | ||
-- You can provide endpoints as a Lua table: | ||
-- | ||
-- local endpoints = { | ||
-- { | ||
-- uri = '127.0.0.1:4401', | ||
-- login = 'sampleuser', | ||
-- password = '123456', | ||
-- } | ||
-- } | ||
|
||
local endpoints = config:get('config.storage.endpoints') | ||
local client = config.storage_client.connect(endpoints) | ||
|
||
-- Put a value to the connected client. | ||
client:put('/v', 'a') | ||
|
||
-- Get all stored values. | ||
local values = client:get('/') | ||
|
||
-- Clean the storage. | ||
local response = client:delete('/') | ||
|
||
-- Watch for key changes. | ||
local log = require('log') | ||
local w = client:watch('/config/main', function() | ||
log.info('config has been updated') | ||
end) | ||
|
||
-- Unregister a watcher. | ||
w:unregister() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: GitHub highlight this part is a bit strage way due to a whitespace before double backtick (after
native
). Not sure how it is rendered by Sphinx, but maybe it worth to drop the whitespace to be on the safe side.