Navigating Business Growth: How TiDB Scales Petabyte-Level Data Volumes
Business growth is an exciting milestone, bringing more users, transactions, and opportunities to innovate. However, with growth comes significant technical challenges, such as: In this blog, we’ll explore how TiDB, an open-source distributed SQL database, addresses these business growth challenges. We’ll also walk through real-world examples from companies such as Bolt and Flipkart. Bolt: A […]
Effective Online DDL: Making Critical Database Schema Changes with Zero Downtime
Online Data Definition Language (DDL) is a crucial feature for modern databases. It allows schema changes without significant downtime or locking that could disrupt database operations. This means these operations carry out while the database continues to be available for reads and writes, minimizing downtime and avoiding disruption to ongoing activities. Online DDL is particularly […]
Multi-Tenant Architecture: Enhancing Database Scalability with TiDB
In the era of cloud computing and Software as a Service (SaaS), it’s essential to optimize resource use and scalability in databases. Multi-tenant architecture meets these needs by allowing a single database instance to serve multiple customers, or tenants. This ensures each tenant’s data remains isolated and secure, leading to enhanced cost efficiency, simplified management, […]
Web3Bench: A New HTAP Benchmark for Web3 Workloads
This blog introduces Web3Bench, a hybrid transaction/analytical processing (HTAP) benchmark that addresses earlier limitations. Web3Bench is based on real-world Web3 use cases that utilize HTAP. Our data model is a simplified version of the decentralized blockchain Ethereum. We leverage a sample data set from Ethereum to build a scale factor-based data generator. The workload in […]
What is Database Sharding? An Architecture Pattern for Increased Database Performance
Database sharding is a data architecture strategy that increases database performance by splitting up data into chunks and then spreading these chunks “intelligently” across multiple database servers (or database instances). These chunks of data are called shards, while each shard contains a subset of our data. All shards represent the entire set of data, and […]
Turning Data into Insights in 10 Minutes with Custom GPTs and APIs
We just built Open Source Benchmark, an innovative GPT designed to streamline open-source software (OSS) comparisons. Powered by our own data APIs and third-party APIs, this GPT analyzes two repositories and offers instant insights for tech decision-making. Open the GPT now and click PyTorch vs Tensor to see how it works in action. How […]
Maintaining Database Connectivity in Serverless Infrastructure with TiProxy
Businesses often struggle to ensure smooth database connections during scaling and upgrades. Frustrations can surface from underused instances after scaling out, disruptions when scaling in, and potential disconnections during upgrades. While connection pools and long-lived connections offer stability, they can cause issues in fluid environments where databases need to adapt quickly. This leads to increased […]
Why Distributed SQL Databases Elevate Modern Application Development
With the explosive growth of data and the need for scalable and efficient systems, traditional relational and NoSQL databases have faced limitations. This has led to the emergence of distributed SQL databases, revolutionizing how organizations handle their data. Choosing the right database to power modern applications can be challenging. For starters, as data volumes grow […]
Raft Engine: A Log-Structured Embedded Storage Engine for Multi-Raft Logs in TiKV
We introduce how we build and implement TiKV’s Raft Engine, a log-structured embedded storage engine for multi-Raft logs, and how it improves storage performance.
How TiDB Processes DML in Data Migration
Learn how Sync, Data Migration's core component, processes DML statements through the binlog replication workflow.
How Good is TiDB as an HTAP System? A HATtrick Benchmark
Inspired by the SIGMOD/PODS 22 paper “How Good is my #HTAP System” and HATtrick, the proposed benchmark tool, we tested the TP/AP performance isolation and horizontal scalability of TiDB as an HTAP database.
How I Found a Go Issue on ARM that Crashed the Database Server
A distributed database like TiDB is a very complex project. Its performance depends on multiple factors. And I will share a peculiar Go bug that caused TiDB to crash on ARM.