From 063b8c66fd57dbd2209921b9034b2182372bec2d Mon Sep 17 00:00:00 2001 From: Daniel Phillips Date: Sun, 6 Jul 2025 12:01:07 -0600 Subject: [PATCH] Update FAQ - Update FAQ Q&A - Add Zola Shortcodes to section out categories Signed-off-by: Daniel Phillips --- topics/faq.md | 168 +++++++++++++++++++++++++------------------------- 1 file changed, 83 insertions(+), 85 deletions(-) diff --git a/topics/faq.md b/topics/faq.md index a8bad50f7..03c0fa71d 100644 --- a/topics/faq.md +++ b/topics/faq.md @@ -1,24 +1,39 @@ --- title: "FAQ" -description: > - Commonly asked questions when getting started with Valkey +description: "Frequently Asked Questions about Valkey" +template: "faq.html" +aliases: ["/docs/topics/faq/", "/faq/"] --- -## How is Valkey different from other key-value stores? - -* Valkey has a different evolution path in the key-value DBs where values can contain more complex data types, with atomic operations defined on those data types. Valkey data types are closely related to fundamental data structures and are exposed to the programmer as such, without additional abstraction layers. -* Valkey is an in-memory but persistent on disk database, so it represents a different trade off where very high write and read speed is achieved with the limitation of data sets that can't be larger than memory. Another advantage of -in-memory databases is that the memory representation of complex data structures -is much simpler to manipulate compared to the same data structures on disk, so -Valkey can do a lot with little internal complexity. At the same time the -two on-disk storage formats (RDB and AOF) don't need to be suitable for random -access, so they are compact and always generated in an append-only fashion -(Even the AOF log rotation is an append-only operation, since the new version -is generated from the copy of data in memory). However this design also involves -different challenges compared to traditional on-disk stores. Being the main data -representation on memory, Valkey operations must be carefully handled to make sure -there is always an updated version of the data set on disk. - -## What's the Valkey memory footprint? + +{% faq_section(class="general") %} + +## General {#general} + +### What is Valkey? + +Valkey is an open-source, in-memory data structure store that can be used as a database, cache, and message broker. It supports various data structures such as strings, hashes, lists, sets, sorted sets with range queries, bitmaps, hyperloglogs, geospatial indexes with radius queries, and streams. + +### How is Valkey different from other key-value stores? + +Valkey has a different evolution path in the key-value DBs where values can contain more complex data types, with atomic operations defined on those data types. Valkey data types are closely related to fundamental data structures and are exposed to the programmer as such, without additional abstraction layers. + +Valkey is an in-memory but persistent on disk database, so it represents a different trade off where very high write and read speed is achieved with the limitation of data sets that can't be larger than memory. Another advantage of in-memory databases is that the memory representation of complex data structures is much simpler to manipulate compared to the same data structures on disk, so Valkey can do a lot with little internal complexity. + +### Can you use Valkey with a disk-based database? + +Yes, a common design pattern involves taking very write-heavy small data in Valkey (and data you need the Valkey data structures to model your problem in an efficient way), and big *blobs* of data into an SQL or eventually consistent on-disk database. Similarly sometimes Valkey is used in order to take in memory another copy of a subset of the same data stored in the on-disk database. This may look similar to caching, but actually is a more advanced model since normally the Valkey dataset is updated together with the on-disk DB dataset, and not refreshed on cache misses. + +### Why did Linux Foundation start the Valkey project? + +Valkey was created as a fork of Redis after Redis changed its licensing model. The Linux Foundation started the Valkey project to ensure the continued availability of an open-source, community-driven in-memory data store that maintains the original Redis API compatibility while being developed under open-source principles. + +{% end %} + +{% faq_section(class="memory-performance") %} + +## Memory & Performance {#memory-performance} + +### What's the Valkey memory footprint? To give you a few examples: @@ -28,96 +43,53 @@ To give you a few examples: Testing your use case is trivial. Use the `valkey-benchmark` utility to generate random data sets then check the space used with the `INFO memory` command. -## Why does Valkey keep its entire dataset in memory? +### Why does Valkey keep its entire dataset in memory? -In the past, developers experimented with Virtual Memory and other systems in order to allow larger than RAM datasets, but after all we are very happy if we can do one thing well: data served from memory, disk used for storage. So for now there are no plans to create an on disk backend for Valkey. Most of what -Valkey is, after all, a direct result of its current design. +In the past, developers experimented with Virtual Memory and other systems in order to allow larger than RAM datasets, but after all we are very happy if we can do one thing well: data served from memory, disk used for storage. So for now there are no plans to create an on disk backend for Valkey. Most of what Valkey is, after all, a direct result of its current design. -If your real problem is not the total RAM needed, but the fact that you need -to split your data set into multiple Valkey instances, please read the -[partitioning page](cluster-tutorial.md) in this documentation for more info. +If your real problem is not the total RAM needed, but the fact that you need to split your data set into multiple Valkey instances, please read the [partitioning page](cluster-tutorial.md) in this documentation for more info. -## Can you use Valkey with a disk-based database? - -Yes, a common design pattern involves taking very write-heavy small data -in Valkey (and data you need the Valkey data structures to model your problem -in an efficient way), and big *blobs* of data into an SQL or eventually -consistent on-disk database. Similarly sometimes Valkey is used in order to -take in memory another copy of a subset of the same data stored in the on-disk -database. This may look similar to caching, but actually is a more advanced model -since normally the Valkey dataset is updated together with the on-disk DB dataset, -and not refreshed on cache misses. - -## How can I reduce Valkey' overall memory usage? +### How can I reduce Valkey's overall memory usage? A good practice is to consider memory consumption when mapping your logical data model to the physical data model within Valkey. These considerations include using specific data types, key patterns, and normalization. Beyond data modeling, there is more info in the [Memory Optimization page](memory-optimization.md). -## What happens if Valkey runs out of memory? +### What happens if Valkey runs out of memory? -Valkey has built-in protections allowing the users to set a max limit on memory -usage, using the `maxmemory` option in the configuration file to put a limit -to the memory Valkey can use. If this limit is reached, Valkey will start to reply -with an error to write commands (but will continue to accept read-only -commands). +Valkey has built-in protections allowing the users to set a max limit on memory usage, using the `maxmemory` option in the configuration file to put a limit to the memory Valkey can use. If this limit is reached, Valkey will start to reply with an error to write commands (but will continue to accept read-only commands). -You can also configure Valkey to evict keys when the max memory limit -is reached. See the [eviction policy docs](lru-cache.md) for more information on this. +You can also configure Valkey to evict keys when the max memory limit is reached. See the [eviction policy docs](lru-cache.md) for more information on this. -## Background saving fails with a fork() error on Linux? +### How can Valkey use multiple CPUs or cores? -Short answer: `echo 1 > /proc/sys/vm/overcommit_memory` :) +Enable I/O threading to offload client communication to threads. In Valkey 8, the I/O threading implementation has been rewritten and greatly improved. Reading commands from clients and writing replies back uses considerable CPU time. By offloading this work to separate threads, the main thread can focus on executing commands. -And now the long one: +You can also start multiple instances of Valkey in the same box and combine them into a [cluster](cluster-tutorial.md). -The Valkey background saving schema relies on the copy-on-write semantic of the `fork` system call in -modern operating systems: Valkey forks (creates a child process) that is an -exact copy of the parent. The child process dumps the DB on disk and finally -exits. In theory the child should use as much memory as the parent being a -copy, but actually thanks to the copy-on-write semantic implemented by most -modern operating systems the parent and child process will _share_ the common -memory pages. A page will be duplicated only when it changes in the child or in -the parent. Since in theory all the pages may change while the child process is -saving, Linux can't tell in advance how much memory the child will take, so if -the `overcommit_memory` setting is set to zero the fork will fail unless there is -as much free RAM as required to really duplicate all the parent memory pages. -If you have a Valkey dataset of 3 GB and just 2 GB of free -memory it will fail. - -Setting `overcommit_memory` to 1 tells Linux to relax and perform the fork in a -more optimistic allocation fashion, and this is indeed what you want for Valkey. - -You can refer to the [proc(5)][proc5] man page for explanations of the -available values. +{% faq_section(class="technical") %} -[proc5]: https://man7.org/linux/man-pages/man5/proc.5.html +## Technical {#technical} -## Are Valkey on-disk snapshots atomic? +### Background saving fails with a fork() error on Linux? -Yes, the Valkey background saving process is always forked when the server is -outside of the execution of a command, so every command reported to be atomic -in RAM is also atomic from the point of view of the disk snapshot. +Short answer: `echo 1 > /proc/sys/vm/overcommit_memory` :) -## How can Valkey use multiple CPUs or cores? +And now the long one: -Enable I/O threading to offload client communication to threads. -In Valkey 8, the I/O threading implementation has been rewritten and greatly improved. -Reading commands from clients and writing replies back uses considerable CPU time. -By offloading this work to separate threads, the main thread can focus on executing commands. +The Valkey background saving schema relies on the copy-on-write semantic of the `fork` system call in modern operating systems: Valkey forks (creates a child process) that is an exact copy of the parent. The child process dumps the DB on disk and finally exits. In theory the child should use as much memory as the parent being a copy, but actually thanks to the copy-on-write semantic implemented by most modern operating systems the parent and child process will _share_ the common memory pages. A page will be duplicated only when it changes in the child or in the parent. Since in theory all the pages may change while the child process is saving, Linux can't tell in advance how much memory the child will take, so if the `overcommit_memory` setting is set to zero the fork will fail unless there is as much free RAM as required to really duplicate all the parent memory pages. If you have a Valkey dataset of 3 GB and just 2 GB of free memory it will fail. -You can also start multiple instances of Valkey in the same box and combine them into a [cluster](cluster-tutorial.md). +Setting `overcommit_memory` to 1 tells Linux to relax and perform the fork in a more optimistic allocation fashion, and this is indeed what you want for Valkey. -## What is the maximum number of keys a single Valkey instance can hold? What is the maximum number of elements in a Hash, List, Set, and Sorted Set? +You can refer to the [proc(5)][proc5] man page for explanations of the available values. -Valkey can handle up to 232 keys, and was tested in practice to -handle at least 250 million keys per instance. +[proc5]: https://man7.org/linux/man-pages/man5/proc.5.html -Every hash, list, set, and sorted set, can hold 232 elements. +### Are Valkey on-disk snapshots atomic? -In other words your limit is likely the available memory in your system. +Yes, the Valkey background saving process is always forked when the server is outside of the execution of a command, so every command reported to be atomic in RAM is also atomic from the point of view of the disk snapshot. -## Why does my replica have a different number of keys than its primary instance? +### Why does my replica have a different number of keys than its primary instance? If you use keys with limited time to live (Valkey expires) this is normal behavior. This is what happens: @@ -128,6 +100,32 @@ If you use keys with limited time to live (Valkey expires) this is normal behavi Because of this, it's common for users with many expired keys to see fewer keys in the replicas. However, logically, the primary and replica will have the same content. -## Why did Linux Foundation start the Valkey project? +{% faq_section(class="limitations") %} + +## Limitations {#limitations} + +### What is the maximum number of keys a single Valkey instance can hold? What is the maximum number of elements in a Hash, List, Set, and Sorted Set? + +Valkey can handle up to 232 keys, and was tested in practice to handle at least 250 million keys per instance. + +Every hash, list, set, and sorted set, can hold 232 elements. + +In other words your limit is likely the available memory in your system. + +### What are the main limitations of Valkey? + +The main limitations of Valkey include: + +* **Memory-based storage**: All data must fit in RAM, which limits the total dataset size to available memory. +* **Single-threaded command execution**: While I/O threading is available, commands are executed sequentially in the main thread. +* **No built-in authentication**: Authentication must be implemented at the application level or through network security. +* **Limited query capabilities**: Unlike SQL databases, Valkey doesn't support complex queries or joins. +* **No built-in backup**: Backup and recovery must be implemented using external tools or scripts. + +### Can Valkey handle transactions? + +Yes, Valkey supports transactions through the `MULTI`, `EXEC`, `DISCARD`, and `WATCH` commands. However, these are not ACID transactions in the traditional database sense. Valkey transactions provide atomicity (all commands in a transaction are executed or none are) but not isolation (other clients can see intermediate states). + +### What happens when Valkey runs out of disk space? -Read about [the history of Valkey](history.md). +When Valkey runs out of disk space, it will stop accepting write commands and return errors. The server will continue to serve read commands but won't be able to persist data to disk. It's important to monitor disk space and implement proper alerting to prevent this situation.