Selecting an EC2 Instance for KeyDB
When searching for information on what instance type to use, the answer is typically “it depends”. This article boils down instance types with both general data on EC2 types as well as in depth analysis of KeyDB performance and selection for the different types.
The focus of this article will be using KeyDB as an in memory database. There will be future articles on using KeyDB with FLASH storage, as well as with some of KeyDB-Pro’s features such as FLASH persistence and advanced querying where many more cores can be taken advantage of under query intensive workloads (O(n) heavy operations such as KEYS). When AWS Graviton2's are available we will publish those results also.
Lets start taking a simple look at the breakdown of using EC2 instances to give you a general idea before digging heavy into analysis
Cost per GB of each Instance Type
As an in-memory database the first question is what types will optimize cost seeing as memory (RAM) is often the main caveat. At an initial glance you see that x1 instances are priced the best along with r5 and t instances. However going with the cheapest type may not be the best choice or the right choice for you. The other factors going into the decision are how much storage you are looking for as well as the performance you are looking for. Please note all costs recorded in this article are for on demand instances.
Performance and Vertical Scalability
Performance is usually a big factor to prevent latencies when heavily loaded. KeyDB is a fast multithreaded database that can scale vertically before scaling horizontally. The chart below selects several instance types displaying the first (lowest cost) instance that is saturated at peak performance with Memtier benchmark. It can be seen that C5, R5, and M5 instances are able to get the best performance out of the set. These instances have faster CPU’s and are able to get 1 million ops/sec each.
As you can see there are pros and cons of each instance type depending on what you are looking for. To get a better understanding of each we will take a look at some more data in order to make generalizations on each type described later in this report.
The charts below show a lot of these instances. It compares the available memory (GB) of each instance, the benchmarked ops/sec, the price/GB ram, and the cost normalized for 1000ops/sec as a cost per perf comparison. These charts can make it much easier to select and refine the best instance for your use case.
Please note that t2 & t3 instances are shaded differently because they are burstable instances and the numbers represent peak load, not sustained load. If CPU credits are used up the instances may be throttled down drastically, hence understanding your usage when selecting them is critical
Performance, Memory or both?
Because KeyDB is multithreaded and extremely fast most of our users tend not to run more than one instance (except for high availability). This reduces the complexity of the setup and offers great benefits for those not wanting to cluster. To get an idea of how KeyDB optimizes resources, KeyDB’s multithreading will get the equivalent performance of a 7 node Redis cluster on the same machine at which point both saturate the performance of the machine.
The first trend you may notice is that performance becomes saturated above .4xlarge machines. So if performance becomes a bottleneck for your application you will shard at or above this machine size depending on how much memory you require for your database. This ensures you scale the instance vertically first. Prior to clustering you can also create a read-replica or active-replica instance to handle some traffic and provide high availability.
Instance Pro’s and Con’s
For KeyDB there are different benefits and downfalls of each instance type depending on your setup/use case. We will outline benefits as they relate to KeyDB. For more on types you can read up here. Hopefully this summary helps refine the decision:
M5 instances are great for most scenarios. They have a 3.1 GHz Intel Xeon® Platinum 8175 processor that enables KeyDB to get over 1 million ops/sec. Memory is available up to 384GB. If you are looking for performance and don’t have a memory requirement as high as the r5 instances, this is a good route to go.
R5 instances offer the best combination of performance and memory available. Memory with r5 instances is one of the cheapest per GB and it has the same processor as the m5 enabling over 1 million ops/sec. For large datasets requiring high performance the r5 instances are one of the best options.
C5 compute optimized instances use a 3.5GHz Intel Xeon Scalable processor which is very fast, however the performance is similar to the m5/r5 processors getting just over 1 million ops/sec. For performance the c5 instances work with reasonably cost effective ops/sec for what you get. However if you are limited by memory in any way, the c5 instances are the priciest when it comes to $/GB. Most use cases will not benefit with c5 instances over m5/r5.
T2 and T3 instances are great potential options for small fluctuating use cases. The t3 instances do get better performance than the t2 instances in our testing as the processors were slightly faster. The main thing to note about t2 & t3 instances is that they are burstable. This means they can perform great for short bursts but don’t expect that performance continually. You have CPU credits that are used, and if you go over, your CPU is throttled down which can create a negative high latency experience for your users. Some instances may be throttled down to as low as 5% of capacity so its important to understand your usage to see if it’s the right fit. For burstable stats and description check out this doc.
A1 arm instances are one of amazons more affordable options using the AWS Graviton Processors. These processors are better for smaller databases that perform under sustained loads unlike the burstable t3/t2 instances. A1 instances provide the cheapest price when it comes to ops/sec (volume of operations processed per dollar spent), however the cost per GB memory is relatively high and the 2.3GHz processor does not get as high ops/sec as other instance types. For cases already using arm applications and workloads, KeyDB on Arm is a great fit and works well with it. We expect very large performance gains from the new Graviton2's and will publish the information on them as its obtained.
X1 instances have the lowest cost per GB memory, however the 2.3GHz Intel Xeon E7 8880 v3 processor caps out at 620,000ops/sec so doesn’t achieve quite as high as the m5/r5 instances. However for cases requiring a lot of in memory storage that are not nearing performance limits of the instance, the X1 family might be the right option for you.
Storage optimized instances such as the i3 family are great especially for using features such as KeyDB on FLASH. We didn’t cover this instance type in much detail as the FLASH argument vs instance type will be another article in itself. The i3 has a 2.3GHz Intel Xeon E5-2686 v4 processor and maxes out around 400,000 ops/sec. Costs are not comparable to other instances unless you are taking advantage of the SSD storage. The i3en instances use Intel® Xeon® Scalable (Skylake) or Xeon(R) Platinum 8175M processor, and can get as much as 845,000ops/sec. Hence paying attention to the instance specifiers is important when selecting.
When selecting EC2 instances there are a few other factors to consider. I do not intend on going into detail on these, however it helps to be aware of your options:
- You may notice suffixes on the instances such as d,n,a or e. (ie. m5d.4xlarge)
- ‘n’ means network optimized (higher bandwidth). Processor may be different also so take a look.
- ‘d’ means it has additional NVMe-based SSD storage. This is physically connected to the host server and provides block-level storage that is coupled to the lifetime of the instance
- ‘a’ means it has an AMD processor
- You can compare processor types and stats here.
- Depending on your use case there is on demand pricing (used in this article) which is per hour billing. However if you plan on running the server continually you can look at reserved instance pricing which can be up to 75% cheaper. Spot instances can be requested for those with flexibility. See here for more info.
- Keep in mind the processor used as it can result in different performance. It is always recommended to run some sort of benchmarking when selecting instances. One reason other than selecting the right type, is to have a baseline to compare with to help identify bottlenecks in your setup.
- EBS optimized instances can ensure dedicated throughput between ec2 and EBS volumes which can have benefits in some cases for backing up and saving. S3 options are also a cheap option available that is built into KeyDB.
There is a lot to chew on when trying to refine your selection. Finding the right compromise between cost, memory, performance, storage largely depends on the loading/traffic on your database. However once you have an idea, it becomes a bit easier to choose. Hopefully this article provides some insight on making your selections without being an expert on EC2 instances. The AWS documentation is really good for improving your understanding.
KeyDB on FLASH Comparison
As evident in this article there are equally as many questions when setting up a low cost, high storage instance using FLASH options heavily. KeyDB is releasing a new version of FLASH shortly and will be be doing an evaluation on AWS instances.
The Memtier benchmarking instance was a m5.8xlarge instance. An instance of this size is required to max out traffic to KeyDB. The following command was used for testing in most cases.
memtier_benchmark -s <ipaddressofserver> --threads=32
KeyDB test instance typically run with the following command. Note that # threads will be truncated based on the available resources.
keydb-server --server-threads 7 --server-thread-affinity true
If you need authentication pass argument
--authenticate=<yourpassword> to memtier and
--requirepass <yourpassword> to KeyDB. If on secure network you can use
--protected-mode no for KeyDB.
If your memtier instance is not large enough it will be the bottleneck and will skew results. If you are going through a VPC, VPN, or load balancer its possible you may have a bottleneck there. Otherwise the numbers should be very similar to those posted.