Loro Blog

Mergeable Containers: Fixing Concurrent Child Creation

Tue, 09 Jun 2026 00:00:00 GMT

Mergeable Containers: Fixing Concurrent Child Creation

Two users are offline. Both add content to the same empty note. They come back online, sync finishes, and one user's edits seem to disappear.

There is no error, and the data is not actually gone from history. But note.get("body") can only return one Text container. The other container was created concurrently and still exists in history, but it is no longer visible in the current document state. From the application's point of view, this looks like data loss.

This is a classic problem in JSON-like CRDTs. Users have run into versions of it in the Loro, Yjs, and Automerge communities. The Appendix has short scripts that reproduce it in all three.

Loro now solves this with Mergeable Containers. They make a child container's identity come from its logical position in the Map, not from the ID of the operation that happened to create it.

Special thanks to Alexis Williams from Synapdeck for the substantial implementation work and design discussion behind this feature.

From the user's point of view, the API change is small. Instead of creating an on-demand child container like this:

// Peer A
doc.getMap("days").setContainer("2026-06-08", new LoroList()).insert(0, "A");

// Peer B, offline
doc.getMap("days").setContainer("2026-06-08", new LoroList()).insert(0, "B");

// after sync: only one List is visible at "2026-06-08"

you can use a mergeable child:

// Peer A
doc.getMap("days").ensureMergeableList("2026-06-08").insert(0, "A");

// Peer B, offline
doc.getMap("days").ensureMergeableList("2026-06-08").insert(0, "B");

// after sync: both peers edit the same List

As a rule of thumb, use ensureMergeable* when a child container should be identified by its logical position:

map.ensureMergeableText(key);
map.ensureMergeableMap(key);
map.ensureMergeableList(key);
map.ensureMergeableMovableList(key);
map.ensureMergeableTree(key);
map.ensureMergeableCounter(key);

Use them for fields that should behave like one shared child container for everyone: one shared Text, one shared List, one shared Map, and so on. It should not matter which peer creates that child first. The rest of this post walks through why the problem exists and how the new encoding works.

Why This Happens

CRDTs are usually good at cases like "multiple users editing the same text at the same time" or "multiple users inserting into the same list concurrently." This issue happens one layer earlier: before the peers can edit the same List, Text, or Map, they first need to agree on which child container that key refers to.

Before Mergeable Containers, the recommended workaround was to initialize all required child containers as soon as the parent LoroMap was created. For example, if every note always needs a body text, creating that body together with the note avoids the first-creation race.

That workaround is useful, but it has limits. Some applications cannot know every child container ahead of time. A schema migration may add a new child container to existing documents. A calendar-like document may create child containers by date. A dynamic index may create one child container per user-defined key. In these cases, on-demand creation is natural, and concurrent first creation is hard to avoid.

The root cause is the way regular child Container IDs are represented. A normal child Container ID includes the OpID that created it. Concurrent first creation therefore creates different Container IDs, and the Map conflict-resolution rule decides which one is visible.

The issue is not that List insertion cannot merge. Once both peers are editing the same List, List edits merge normally. The issue is that the two peers created two different Lists at the same Map key.

Why Root Containers Are Naturally Mergeable

In Loro and Yjs, top-level Root Containers are usually accessed by name:

doc.getMap("state");
doc.getText("content");

Here, "state" or "content" is already a stable identity. It does not depend on which peer created it or which operation created it. As long as multiple peers access the same root name, they naturally refer to the same logical Container.

Automerge has a different object identity model, so this root-container comparison is specifically about Loro and Yjs. The broader issue is still similar: when composite values are created concurrently at the same key, the system needs a rule for which object identity becomes visible.

Regular child Containers are different. Their identity is tied to the operation that created them, so two concurrent "first creations" become two different objects.

Mergeable Containers bring the useful part of Root Container identity to selected child Containers: the child identity comes from a deterministic name, not from the creation operation.

API: Explicitly Ensuring a Mergeable Child

This feature does not change the existing setContainer / insertContainer behavior. It adds explicit ensureMergeable* APIs for the mergeable case. In Rust, the same methods use snake case:

map.ensure_mergeable_text("body")?;
map.ensure_mergeable_map("profile")?;

The word ensure is intentional. It returns the child and, if needed, writes the marker that makes it visible at that key. Calling the same method again for the same type is idempotent.

If the key already holds a regular scalar value or a regular child Container, the API returns an error instead of silently overwriting it.

One subtle case is type changes. If one peer asks for a mergeable Text at "field" while another peer asks for a mergeable Map at the same key, Loro still needs one visible value at that key. The Map's normal conflict rule decides which type is visible. The non-visible mergeable child's state is still preserved under its deterministic ID, so switching back to that type can resurface it later.

Core Design: Deterministic CID + Map Slot Marker

Mergeable Containers have two separate layers of representation:

The child Container ID derived from the parent Container ID, key, and type. This decides whether peers address the same CRDT object.
The parent Map slot. This decides whether that object is currently visible at a key, and which mergeable child type is active there.

Keeping these two layers separate makes the behavior easier to reason about.

1. CID: A Synthetic Root Container ID

A Mergeable Container uses a synthetic ContainerID::Root under an internal namespace. User-created root names cannot use this prefix, so ordinary roots cannot collide with mergeable CIDs:

🤝:

The payload is derived from the parent Map and the key. The Container type stays in ContainerID::Root.container_type, just like ordinary Root Containers. This lets all peers derive the same child ID without using the creation OpID.

The current encoding keeps nested mergeable Map IDs linear in the logical path length. This change was made before release to avoid recursive CID growth for deeply nested mergeable maps.

More details: the flattened CID encoding

Loro Protocol

Thu, 30 Oct 2025 00:00:00 GMT

Loro Protocol

The open-source Loro Protocol project includes the loro-websocket package, the adaptor suite in loro-adaptors, and matching Rust client and server implementations that all interoperate on the same wire format.

The Loro Protocol is a wire protocol designed for real-time CRDT synchronization. Learn about the design in detail here.

It efficiently runs multiple, independent "rooms" over a single WebSocket connection.

This allows you to synchronize your application state, such as a Loro document, ephemeral cursor positions, and end-to-end encrypted documents, over one connection. It is also compatible with Yjs.

Quick Start: Server & Client Example

The protocol is implemented by the loro-websocket client and a minimal SimpleServer for testing. These components are bridged to your CRDT state using loro-adaptors.

Server

For development, you can run the SimpleServer (from loro-websocket) in a Node.js environment.

// server.ts

const server = new SimpleServer({
  port: 8787,
  // SimpleServer accepts hooks for authentication and data persistence:
  // authenticate: async (roomId, crdt, auth) => { ... },
  // onLoadDocument: async (roomId, crdt) => { ... },
  // onSaveDocument: async (roomId, crdt, data) => { ... },
});

server.start().then(() => {
  console.log("SimpleServer listening on ws://localhost:8787");
});

Client

On the client side, you connect once and then join multiple rooms using different adaptors.

// client.ts

// 1. Create and connect the client
const client = new LoroWebsocketClient({ url: "ws://localhost:8787" });
await client.waitConnected();
console.log("Client connected!");

// --- Room 1: A Loro Document (%LOR) ---
const docAdaptor = new LoroAdaptor();
const docRoom = await client.join({
  roomId: "doc:123",
  crdtAdaptor: docAdaptor,
});

// Local edits are now automatically synced
const text = docAdaptor.getDoc().getText("content");
text.insert(0, "Hello, Loro!");
docAdaptor.getDoc().commit();

// --- Room 2: Ephemeral Presence (%EPH) on the SAME socket ---
const ephAdaptor = new LoroEphemeralAdaptor();
const presenceRoom = await client.join({
  roomId: "doc:123", // Can be the same room ID, but different magic bytes
  crdtAdaptor: ephAdaptor,
});

// Ephemeral state syncs, but is not persisted by the server
ephAdaptor.getStore().set("cursor", { x: 100, y: 100 });

Features

Multiplexing

Each binary message is prefixed with four magic bytes that identify the data type, followed by the roomId. This structure allows the server to route messages to the correct handler. A single client can join:

%LOR (Loro Document)
%EPH (Loro Ephemeral Store, for cursors and presence)
%ELO (End-to-End Encrypted Loro Document)
%YJS and %YAW (for Yjs Document and Awareness interoperability)

All traffic runs on the same socket.

Compatibility

The Loro Protocol is designed to accommodate environments like Cloudflare:

Fragmentation: Large updates are automatically split into fragments under 256 KiB and reassembled by the receiver. This addresses platforms that enforce WebSocket message size limits.
Application-level keepalive: The protocol defines simple "ping" and "pong" text frames. These bypass the binary envelope and allow the client to check connection liveness, which is useful in browser or serverless environments where transport-level TCP keepalives are not exposed.

This repository also ships Rust clients and servers that mirror the TypeScript packages.

Experimental E2E Encryption

End-to-end encrypted Loro is included in loro-protocol, but the feature is currently experimental: expect wire formats and key-management APIs to change, and do not rely on it for production-grade security audits yet. When paired with EloLoroAdaptor on the client, the server relays encrypted records without decrypting them.

Status and Licensing

The Loro Protocol is mostly stable. We welcome community feedback and contributions, especially regarding use cases that are difficult to satisfy with the current design.

All the packages in inside https://github.com/loro-dev/protocol are open-sourced under the permissive MIT license.

Loro Mirror: Make UI State Collaborative by Mirroring to CRDTs

Mon, 22 Sep 2025 00:00:00 GMT

Loro Mirror: Make UI State Collaborative by Mirroring to CRDTs

TL;DR. Loro Mirror keeps a typed, immutable app‑state view in sync with a Loro CRDT document. Local setState edits become granular CRDT operations; incoming CRDT events update your state. You keep familiar React patterns and gain collaboration, offline edits, and history.

CRDT: A Conflict‑free Replicated Data Type lets multiple peers edit concurrently and still converge without central coordination.

Local‑first: Data is usable offline and synced later; the device is the primary source of truth.

Overview

Loro is a CRDT library for local‑first apps. It supports rich containers—Text, Map, List/MovableList, MovableTree—with versioning, time‑travel, and compact updates/snapshots.

Though CRDTs ensure CRDTs states converge, apps still need glue code to map between CRDT documents and UI state to ensure their consistency. It's not an easy task.

Loro Mirror addresses this boundary. You declare a schema once. Mirror maintains an immutable app‑state view and handles both directions:

Event → state. Loro events update your state.
State → CRDT. setState diffs become container‑level CRDT ops (insert / delete / move / text edits).

For an update, if k items change and each changed item affects m of its immediate fields, time complexity is ≈ O(k·m). (k = number of changed items; m = average number of changed immediate fields per changed item.) This is similar to React’s render complexity.

Why this exists

Without Mirror, projects that uses Loro need to:

Map CRDTs states to UI states
Diff UI edits and translate them to CRDT operations
Subscribe to CRDT events and patch UI state

This code is repetitive and easy to get wrong. Mirror centralizes it behind a declarative schema.

What Mirror provides

Declarative schema. Describe UI state in terms of Loro containers; Mirror maintains an immutable view.
Typed and framework‑agnostic. Works in plain TypeScript, React (via loro-mirror-react) or any other UI framework that supports immutable states.
Fine‑grained diffs. Generates ops such as item moves in MovableList and character deltas in Text.

How to use

Define a schema that describes your app state
Create a LoroDoc and a Mirror store; provide schema
Update via setState. Subscribe for changes if needed.
Sync across peers using Loro updates; Mirror applies remote delta back to your app state automatically.

Basic Example

/**
 * As an example, you can use `useState` from React to manage the state
 *
 * `const [appState, setAppState] = useState({});`
 */
function setAppState(state: any) {}
// ---cut---

// 1) Declare state shape – a MovableList of todos with stable Container ID `$cid`
type TodoStatus = "todo" | "inProgress" | "done";
const appSchema = schema({
  todos: schema.LoroMovableList(
    schema.LoroMap({
      text: schema.String(),
      status: schema.String(),
    }),
    // $cid is the container ID of LoroMap assigned by Loro
    (t) => t.$cid,
  ),
});

// 2) Create a Loro document and a Mirror store
const doc = new LoroDoc();
const store = new Mirror({
  doc,
  schema: appSchema,
  // InitialState will not be written into LoroDoc
  initialState: { todos: [] },
});

// 3) Subscribe (optional) – know whether updates came from local or remote
const unsubscribe = store.subscribe((state, { direction, tags }) => {
  if (direction === SyncDirection.FROM_LORO) {
    console.log("Remote update", { state, tags });
  } else {
    console.log("Local update", { state, tags });
  }

  // You can use `state` to render directly, it's a new immutable object that shares
  // the unchanged fields with the old state
  setAppState(state);
});

// 4) Either draft‑mutate or return a new state
// Draft‑style (mutate a draft)
store.setState((s) => {
  s.todos.push({ text: "Draft add", status: "todo" });
});

// Immutable return (construct a new object)
store.setState((s) => ({
  ...s,
  todos: [...s.todos, { text: "Immutable add", status: "todo" }],
}));

// 5) Sync across peers with Loro updates (transport‑agnostic)
// Example: two docs in memory – in real apps, send `bytes` over WS/HTTP/WebRTC
const other = new LoroDoc();
other.import(doc.export({ mode: "snapshot" }));

// Wire realtime sync (local updates → remote import)
const stop = doc.subscribeLocalUpdates((bytes) => {
  other.import(bytes);
});

// Any `store.setState(...)` on `doc` now appears in `other` as well

React Example


type TodoStatus = "todo" | "inProgress" | "done";

const todoSchema = schema({
  todos: schema.LoroMovableList(
    schema.LoroMap({
      text: schema.String(),
      status: schema.String(),
    }),
    (t) => t.$cid,
  ),
});

export function TodoApp() {
  const doc = useMemo(() => new LoroDoc(), []);
  const { state, setState } = useLoroStore({
    doc,
    schema: todoSchema,
    initialState: { todos: [] },
  });

  function addTodo(text: string) {
    setState((s) => {
      s.todos.push({ text, status: "todo" });
    });
  }

  return (
    <>
      
      
        {state.todos.map((t) => (
          
            
                setState((s) => {
                  const i = s.todos.findIndex((x) => x.$cid === t.$cid);
                  // Text delta will be calculated automatically
                  if (i !== -1) s.todos[i].text = e.target.value;
                })
              }
            />
            
          
        ))}
      
    
  );
}

Undo/Redo


// Inside the same component, after creating `doc`:
const undo = useMemo(() => new UndoManager(doc), [doc]);

// Add controls anywhere in your UI:

  
  
  {/* UndoManager only reverts your local edits; remote edits stay. */}
  {/* See docs:  */}
  {/* For full time travel, see:  */}
;

What you get

Type-safe, framework-agnostic state
Each mutation becomes a minimal change-set (CRDT delta)—no manual diffing
Fine-grained updates to subscribers for fast, predictable renders
Built-in history and time travel
Offline-first sync via updates or snapshots with deterministic conflict resolution over any transport (HTTP, WebSocket, P2P)
Collaborative undo/redo across clients

We built a example PWA app here https://todo.loro.dev . It’s open source at https://github.com/loro-dev/loro-todo. It’s collaborative and account-free. The data will be persisted locally in IndexedDB and saved in the cloud for 7 days. You can share your todo list with others by just sharing the unique URL. In the codebase, only a tiny portion of the code is about Loro thanks to the help of loro-mirror.

Where we’re going

Because Mirror owns the bidirectional mapping between application state and the Loro document, we can move value up the stack while lowering integration cost. For example:

Text. Many interfaces render by lines, yet LoroText’s low‑level API is index‑based. Teams typically re‑implement line segmentation and map edits back to lines by hand. With Mirror in the middle, it becomes feasible to surface optional line‑aware events on top of LoroText so the UI receives stable, line‑based diffs without custom conversion—while retaining the underlying CRDT guarantees.
Tree. LoroTree CRDT already ensures correct concurrent moves, but developers still translate tree operations into application‑state patches. Mirror carries first‑class mappings from tree events into your state shape, so consumers can work with natural “insert/move/delete node” updates.
Ephemeral patches. We'll add setStateWithEphemeralPatch so Mirror can stream temporary drag or scale interactions through an EphemeralStore, letting collaborators see live previews while the persisted history stays clean and deduplicated once the change finalizes.

By using loro-mirror to bridge CRDTs and application state consistency, and by expressing schemas declaratively, we can let AI help developers get more done correctly. This makes Loro not only suitable for professional creative tools with real-time collaboration, but also for enabling people to build practical mini-tools for themselves and their communities.

If this work helps you build collaborative, local‑first experiences, we’d be grateful for your sponsorship. You can support us via GitHub Sponsors.

Loro 1.0

Wed, 23 Oct 2024 00:00:00 GMT

Loro 1.0

Loro is a Conflict-free Replicated Data Type (CRDT) library that developers can use to implement real-time collaboration and version control in their applications. You can use Loro to create local-first software. Loro 1.0 has a stable data format, excellent performance, and rich features. You can use it in Rust, JS (via WASM), and Swift.

What is CRDT? What is it used for?

Movable tree CRDTs and Loro's implementation

Thu, 18 Jul 2024 00:00:00 GMT

Movable tree CRDTs and Loro's implementation

This article introduces the implementation difficulties and challenges of Movable Tree CRDTs when collaboration, and how Loro implements it and sorts child nodes. The algorithm has high performance and can be used in production.

Background

In distributed systems and collaborative software, managing hierarchical relationships is difficult and complex. Challenges arise in resolving conflicts and meeting user expectations when working with the data structure that models movement by combining deletion and insertion. For instance, if a node is concurrently moved to different parents in replicas, it may lead to the unintended creation of duplicate nodes with the same content. Because the node is deleted twice and created under two parents.

Currently, many software solutions offer different levels of support and functionality for managing hierarchical data structures in distributed environments. The key variation among these solutions lies in their approaches to handling potential conflicts.

Conflicts in Movable Trees

A movable tree has 3 primary operations: creation, deletion, and movement. Consider a scenario where two peers independently execute various operations on their respective replicas of the same movable tree. Synchronizing these operations can lead to potential conflicts, such as:

The same node was deleted and moved
The same node was moved under different nodes
Different nodes were moved, resulting in a cycle
The ancestor node is deleted while the descendant node is moved

Deletion and Movement of the Same Node

This situation is relatively easy to resolve. It can be addressed by applying one of the operations while ignoring the other based on the timestamp in the distributed system or the application's specific requirements. Either approach yields an acceptable outcome.

Moving the Same Node Under Different Parents

Merging concurrent movement operations of the same node is slightly more complex. Different approaches can be adopted depending on the application:

Delete the node and create copies of nodes under different parent nodes. Subsequent operations then treat these nodes independently. This approach is acceptable when node uniqueness is not critical.
Allow the node have two edges pointing to different parents. However, this approach breaks the fundamental tree structure and is generally not considered acceptable.
Sort all operations, then apply them one by one. The order can be determined by timestamps in a distributed system. Providing the system maintains a consistent operation sequence, it ensures uniform results across all peers.

Movement of Different Nodes Resulting in a Cycle

Concurrent movement operations that cause cycles make the conflict resolution of movable trees complex. Matthew Weidner listed several solutions to resolve cycles in his blog.

Error. Some desktop file sync apps do this in practice (Martin Kleppmann et al. (2022) give an example).

Render the cycle nodes (and their descendants) in a special “time-out” zone. They will stay there until some user manually fixes the cycle.

Use a server to process move ops. When the server receives an op, if it would create a cycle in the server’s own state, the server rejects it and tells users to do likewise. This is what Figma does. Users can still process move ops optimistically, but they are tentative until confirmed by the server. (Optimistic updates can cause temporary cycles for users; in that case, Figma uses strategy (2): it hides the cycle nodes.)

Similar, but use a topological sort (below) instead of a server’s receipt order. When processing ops in the sort order, if an op would create a cycle, skip it (Martin Kleppmann et al. 2022).

For forests: Within each cycle, let B.parent = A be the edge whose set operation has the largest LWW timestamp. At render time, “hide” that edge, instead rendering B.parent = "none", but don’t change the actual CRDT state. This hides one of the concurrent edges that created the cycle. • To prevent future surprises, users’ apps should follow the rule: before performing any operation that would create or destroy a cycle involving a hidden edge, first “affirm” that hidden edge, by performing an op that sets B.parent = "none".

For trees: Similar, except instead of rendering B.parent = "none", render the previous parent for B - as if the bad operation never happened. More generally, you might have to backtrack several operations. Both Hall et al. (2018) and Nair et al. (2022) describe strategies along these lines.

Ancestor Node Deletion and Descendant Node Movement

The most easily overlooked scenario is moving descendant nodes when deleting an ancestor node. If all descendant nodes of the ancestor are deleted directly, users may easily misunderstand that their data has been lost.

How Popular Applications Handle Conflicts

Dropbox is a file data synchronization software. Initially, Dropbox treated file movement as a two-step process: deletion from the original location followed by creation at a new location. However, this method risked data loss, especially if a power outage or system crash occurred between the delete and create operations.

Today, when multiple people move the same file concurrently and attempt to save their changes, Dropbox detects a conflict. In this scenario, it typically saves one version of the original file and creates a new "conflicted copy" for the changes made by one of the users.

The image shows the conflict that occurs when A is moved to the B folder and B is moved to the A folder concurrently.

Figma is a real-time collaborative prototyping tool. They consider tree structures as the most complex part of the collaborative system, as detailed in their blog post about multiplayer technology. To maintain consistency, each element in Figma has a "parent" attribute. The centralized server plays a crucial role in ensuring the integrity of these structures. It monitors updates from various users and checks if any operation would result in a cycle. If a potential cycle is detected, the server rejects the operation.

However, due to network delays and similar issues, there can be instances where updates from users temporarily create a cycle before the server has the chance to reject them. Figma acknowledges that this situation is uncommon. Their solution is straightforward yet effective: they temporarily preserve this state and hide the elements involved in the cycle. This approach lasts until the server formally rejects the operation, ensuring both the stability of the system and a seamless user experience.

![An animation that demonstrates how Figma resolves conflicts.](./movable-tree/figma-tree.gif)

An animation that demonstrates how Figma resolves conflicts.

Movable Tree CRDTs

The applications mentioned above use movable trees and resolve conflicts based on centralized solutions. Another alternative approach to collaborative tree structures is using Conflict-free Replicated Data Types (CRDTs). While initial CRDT-based algorithms were challenging to implement and incurred significant storage overhead as noted in prior research, such as Abstract unordered and ordered trees CRDT or File system on CRDT, but continual optimization and improvement have made several CRDT-based tree synchronization algorithms suitable for certain production environments. This article highlights two innovative CRDT-based approaches for movable trees. The first is presented by Martin Kleppmann et al. in their work A highly-available move operation for replicated trees and the second by Evan Wallace in his CRDT: Mutable Tree Hierarchy.

A highly-available move operation for replicated trees

This paper unifies the three operations used in trees (creating, deleting, and moving nodes) into a move operation. The move operation is defined as a four-tuple Move t p m c, where t is the operation's unique and ordered timestamp such as Lamport timestamp, p is the parent node ID, m is the metadata associated with the node, and c is the child node ID.

If all nodes of the tree do not contain c, this is a creation operation that creates a child node c under parent node p. Otherwise, it is a move operation that moves c from its original parent to the new parent p. Additionally, node deletion is elegantly handled by introducing a designated TRASH node; moving a node to TRASH implies its deletion, with all descendants of TRASH considered deleted. But they remain in memory to prevent concurrent editing from moving them to other nodes. In order to handle the previously mentioned situation of deleting ancestor nodes and moving descendant nodes concurrently.

In the three potential conflicts mentioned earlier, since deletion is also defined as a move operation, deleting and moving the same node is transformed into two move operations, leaving only two remaining problems:

Moving the same node under different parents
Moving different nodes, creating a cycle

Logical timestamps are added so that all operations can be linearly ordered, thus the first conflict can be avoided as they can be expressed as two operations in sequence rather than concurrently for the same node. Therefore, in modeling a Tree using only move operations, the only exceptional case in concurrent editing would be creating a cycle, and operations causing a cycle are termed unsafe operations.

This algorithm sorts all move operations according to their timestamps. It can then sequentially apply each operation. Before applying, the algorithm detects cycles to determine whether an operation is safe. If the operation creates a cycle, we ignore the unsafe operation to ensure the correct structure of the tree.

Based on the above approach, the consistency problem of movable trees becomes the following two questions:

How to introduce global order to operations
How to apply a remote operation that should be inserted in the middle of an existing sorted sequence of operations

Globally Ordered Logical Timestamps

Lamport Timestamp can determine the causal order of events in a distributed system. Here's how they work: each peer starts with a counter initialized to 0. When a local event occurs, the counter is increased by 1, and this value becomes the event's Lamport Timestamp. When peer A sends a message to peer B, A attaches its Lamport Timestamp to the message. Upon receiving the message, peer B compares its current logical clock value with the timestamp in the message and updates its logical clock to the larger value.

To globally sort events, we first look at the Lamport Timestamps: smaller numbers mean earlier events. If two events have the same timestamp, we use the unique ID of the peer serves as a tiebreaker.

Apply a Remote Operation

An op's safety depends on the tree's state when applied, avoiding cycles. Insertion requires evaluating the state formed by all preceding ops. For remote updates, we may need to:

Undo recent ops
Insert the new op
Reapply undone ops

This ensures proper integration of new ops into the existing sequence.

Undo Recent Ops

Since we've modeled all operations on the tree as move operations, undoing a move operation involves either moving the node back to its old parent or undoing the operation that created this node. To enable quick undoing, we cache and record the old parent of the node before applying each move operation.

Apply the Remote Op

Upon encountering an unsafe operation, disregarding its effects prevents the creation of a cycle. Nevertheless, it's essential to record the operation, as the safety of an operation is determined dynamically. For instance, if we receive and sort an update that deletes another node causing the cycle prior to this operation, the operation that was initially unsafe becomes safe. Additionally, we need to mark this unsafe operation as ineffective, since during undo operations, it's necessary to query the old parent node, which is the target parent of the last effective operation in the sequence targeting this node.

Reapply Undone Ops

Cycles only occur when receiving updates from other peers, so the undo-do-redo process is also needed at this time. When receiving a new op:

function apply(newOp)
      // Compare the ID of the new operation with existing operations
      if largerThanExistingOpId(newOp.id, oplog)
          // If the new operation's ID is greater, apply it directly
          oplog.applyOp(newOp)
      else
          // If the new operation's ID is not the greatest, undo operations until it can be applied
          undoneOps = oplog.undoUtilCanBeApplied(newOp)
          oplog.applyOp(newOp)
          // After applying the new operation, redo the undone operations to maintain sequence order
          oplog.redoOps(undoneOps)

If the new operation depends on an op that has not been encountered locally, indicating that some inter-version updates are still missing, it is necessary to temporarily cache the new op and wait to apply it until the missing updates are received.
Compare the new operation with all existing operations. If the opId of the new operation is greater than that of all existing operations, it can be directly applied. If the new operation is safe, record the parent node of the target node as the old parent node, then apply the move operation to change the current state. If it is not safe, mark this operation as ineffective and ignore the operation's impact.
If the new opId is sorted in the middle of the existing sequence, it is necessary to pop the operations that are sorted later from the sequence one by one, and undo the impact of this operation, which means moving back to the child of the old parent node, until the new operation can be applied. After applying the new operation, reapply the undone nodes in sequence order, ensuring that all operations are applied in order.

The following animated GIF demonstrates the process executed by Peer1:

Received Peer0 creating node A with the root node as its parent.
Received Peer0 creating node B with A as its parent.
Created node C with A as its parent and synchronized it with Peer0.
Moved C to have B as its parent.
Received Peer0's moving B to have C as its parent.

![](./movable-tree/undo-do-redo.gif)

The queue at the top right of the animation represents the order of local operations and newly received updates. The interpretation of each element in each Block is as follows:

![](./movable-tree/explain.png)

A particular part of this process to note is the two operations with lamport timestamps of 0:3 and 1:3. Initially, the 1:3 operation moving C to B was created and applied locally, followed by receiving Peer0's 0:3 operation moving B to C. In lamport timestamp order, 0:3 is less than 1:3 but greater than 1:2 (with peer as the tiebreaker when counters are equal). To apply the new op, the 1:3 operation is undone first, moving C back to its old parent A, then 0:3 moving B to C is applied. After that, 1:3 is redone, attempting to move C to B again (the old parent remains A, omitted in the animation). However, a cycle is detected during this attempt, preventing the operation from taking effect, and the state of the tree remains unchanged. This completes an undo-do-redo process.

CRDT: Mutable Tree Hierarchy

Evan Wallace has developed an innovative algorithm that enables each node to track all its historical parent nodes, attaching a counter to each recorded parent. The count value of a new parent node is 1 higher than that of all the node's historical parents, indicating the update sequence of the node's parents. The parent with the highest count is considered the current parent node.

During synchronization, this parent node information is also synced. If a cycle occurs, a heuristic algorithm reattaches the nodes causing the cycle back to the nearest historical parent node that won't cause a cycle and is connected to the root node, thus updating the parent node record. This process is repeated until all nodes causing cycles are reattached to the tree, achieving all replica synchronization of the tree structure. The demo in Evan's blog clearly illustrates this process.

As Evan summarized at the end of the article, this algorithm does not require the expensive undo-do-redo process. However, each time a remote move is received, the algorithm needs to determine if all nodes are connected to the root node and reattach the nodes causing cycles back to the tree, which can perform poorly when there are too many nodes.

I established a benchmark to compare the performance of the movable tree algorithms.

Movable Tree CRDTs implementation in Loro

Loro implements the algorithm proposed by Martin Kleppmann et al., A highly-available move operation for replicated trees. On one hand, this algorithm has high performance in most real world scenarios. On the other hand, the core undo-do-redo process of the algorithm is highly similar to how Eg-walker (Event Graph Walker) applies remote updates in Loro. Introduction about Eg-walker can be found in our previous blog.

Movable tree has been introduced in detail, but there is still another problem of tree structure that has not been solved. For movable tree, in some real use cases, we still need the capability to sort child nodes. This is necessary for outline notes or layer management in graphic design softwares. Users need to adjust node order and sync it to other collaborators or devices.

We integrated the Fractional Index algorithm into Loro and combined it with the movable tree, making the child nodes of the movable tree sortable.

There are many introductions to Fractional Index on the web, You can read more about Fractional Index in the Figma blog or Evan blog. In simple terms, Fractional Index assigns a sortable value to each object, and if a new insertion occurs between two objects, the Fractional Index of the new object will be between the left and right values. What we want to speak about more here is how to deal with potential conflicts brought by Fractional Index in CRDTs systems.

Potential Conflicts in Child Node Sorting

As our applications are in a distributive condition, when multiple peers insert new nodes in the same position, the same Fractional Index would be assigned to these differing content but same position nodes. When updates from the remote are applied to local, conflicts arise as the same Fractional Index is encountered.

In Loro, we retain these identical Fractional Index and use PeerID (unique ID of every Peer) as the tie-breaker for the relative order judgment of the same Fractional Index.

Although this solved the sorting problem among the same Fractional Index nodes from different peers, it impacted the generation of new Fractional Index as we cannot generate a new Fractional Index between two same ones. We use two methods to solve this problem:

The first method, as stated in Evan's blog, we could add a certain amount of jitter to each generated Fractional Index, (for the ease of explanation, all examples below take decimal fraction as the Fractional Index) for example, when generating a new Fractional Index between 0 and 1, it should have been 0.5, but through random jitters, it could be 0.52712, 0.58312, 0.52834, etc., thus significantly reducing the chance of same Fractional Index appearing.
If the situation arises where the same Fractional Index is present on both sides, we can handle this problem by resetting these Fractional Index. For example, if we need to insert a new node between 0.7@A and 0.7@B (which indicates Fractional Index @ PeerID), instead of generating a new Fractional Index between 0.7 and 0.7, we could assign two new Fractional Index respectively for the new node and the 0.7@B node between 0.7 and 1, which could be understood as an extra move operations.

Implementation and Encoding Size

Introducing Fractional Index brings the advantage of node sequence. What about encoding size?

Loro uses drifting-in-space Fractional Index implementation based on Vec, which is base 256. In other words, you need to continuously insert 128 values forward or backward from the default value to increase the byte size of the Fractional Index by 1. The worst storage overhead case, such as inserting new values alternately each time. For example, the initial sequence is ab, insert c between a and b, then insert d between c and b, then e between c and d, like:

ab    // [128] [129, 128]
acb   // [128] [129, 127, 128] [129, 128]
acdb  // [128] [129, 127, 128] [129, 127, 129, 128] [129, 128]
acedb // [128] [129, 127, 128] [129, 127, 129, 127, 128] [129, 127, 129, 128] [129, 128]

a new operation would cause an additional byte to be needed. But such a situation is very rare.

Considering that potential conflicts wouldn't appear frequently in most applications, Loro simply extended the implementation, the original implementation produced new Fractional Index in Vec by only increasing or decreasing 1 in certain index to achieve relative sorting. The simple jitter solution was added, by appending random bytes in length of jitter value to Fractional Index. To enable jitter in js, you can use doc.setFractionalIndexJitter(number) with a positive value. But this will increase the encoding size slightly, but each Fractional Index only adds jitter bytes. If you want to generate Fractional Index at the same position with 99% probability without conflict, the relationship between jitter settings and the maximum number of concurrent edits n will be:

jitter	max num of concurrent edits
1	3
2	37
3	582

When there are numerous Fractional Indexes, there will be many common prefixes after being sorted, when Loro encodes these Fractional Indexes, prefix optimization would be implemented. Each Fractional Index only saves the amount of same prefix bits and remaining bytes with the previous one, which further downsizes the overall encoding size.

Related work

Other than using Fractional Index, there are other movable list CRDT that can make sibling nodes of the tree in order. One of these algorithms is Martin Kleppmann's Moving Elements in List CRDTs, which has been used in Loro's Movable List.

In comparison, the implementation of Fractional Index solution is simpler, and no stable position representation is provided for child nodes when modeling nodes in a tree, otherwise, the overall tree structure would be too complex. However, the Fractional Index has the problem of interleaving, but this is acceptable when some only need relative order and do not require strict sequential semantics, such as figma layer items, multi-level bookmarks, etc.

Benchmark

We conducted performance benchmarks on the Movable Tree implementation by Loro, including scenarios of random node movement, switching to historical versions, and performance under extreme conditions with significantly deep tree structures. The results indicate that it is capable of supporting real-time collaboration and enabling seamless historical version checkouts.

Task	Time	Setup
Move 10000 times randomly	28 ms	Create 1000 nodes first
Switch to different versions 1000 times	153 ms	Create 1000 nodes and move 1000 times first
Switch to different versions 1000 times in a tree with depth of 300	701 ms	The new node is a child node of the previous node

Test environment: M2 Max CPU, you can find the bench code here.

Usage


let doc = new Loro();
let tree: LoroTree = doc.getTree("tree");
let root: LoroTreeNode = tree.createNode();
// By default, append to the end of the parent node's children list
let node = root.createNode();
// Specify the child's position
let node2 = root.createNode(0);
// Move `node2` to be the last child of `node`
node2.move(node);
// Move `node` to be the first child of `node2`
node.move(node2, 0);
// Move the node to become the root node
node.move();
// Move the node to be positioned after another node
node.moveAfter(node2);
// Move the node to be positioned before another node
node.moveBefore(node2);
// Retrieve the index of the node within its parent's children
let index = node.index();
// Get the `Fractional Index` of the node
let fractionalIndex = node.fractionalIndex();
// Access the associated data map container
let nodeData: LoroMap = node.data;

Demo

We developed a simulated Todo app with data synchronization among multiple peers using Loro, including the use of Movable Tree to represent subtask relationships, Map to represent various attributes of tasks, and Text to represent task titles, etc. In addition to basic creation, moving, modification, and deletion, we also implemented version switching based on Loro. You can drag the scrollbar to switch between all the historical versions that have been operated on.

Summary

This article discusses why implementing Movable Tree CRDTs is difficult, and presents two innovative algorithms for movable trees.

For implementation, Loro has integrated A highly-available move operation for replicated trees to implement the hierarchical movement of the Tree, and integrated the Fractional Index implementation by drifting-in-space to achieve the movement between child nodes. This can meet the needs of various application scenarios.

If you are developing collaborative applications or are interested in CRDT algorithms, you are welcome to join our community.

Introduction to Loro's Rich Text CRDT

Mon, 22 Jan 2024 00:00:00 GMT

Introduction to Loro's Rich Text CRDT

This article presents the rich text CRDT algorithm implemented in Loro, complying with Peritext's criteria for seamless rich text collaboration. Furthermore, it can be built on top of any List CRDT algorithms and turn them into rich text CRDTs.

Above is an online demo of Loro's rich text CRDT, built with Quill. After the replay, you can simulate real-time collaboration and concurrent editing while offline. You can also drag on the history view to replay the editing history.

If CRDTs are new to you, our article What are CRDTs provides a brief introduction.

Background

Loro is based on the Event Graph Walker (Eg-walker) algorithm proposed by Joseph Gentle, but this algorithm cannot integrate the original version of Peritext. This motivates us to create a new rich text algorithm. It is independent of the specific List CRDTs, thus working nicely with Eg-walker, and is developed on top of them to establish a rich text CRDT.

Before diving into the algorithm of Loro's rich text CRDT, I'd like to briefly introduce Eg-walker and Peritext, and why Peritext cannot be used on Eg-walker.

Recap on List CRDTs

Loro: Reimagine State Management with CRDTs

Mon, 13 Nov 2023 00:00:00 GMT

Loro: Reimagine State Management with CRDTs

Loro, our high-performance CRDTs library, is now open source

In this article, we share our vision for the local-first software development paradigm, explain why we're excited about it, and discuss the current status of Loro.

With better DevTools, documentation, and a friendly ecosystem, everyone can easily build local-first software.

You can build collaborative apps with time travel features easily using Loro. Play the example online.

Envisioning the Local-First Development Paradigm

Distributed states are commonly found in numerous scenarios, such as multiplayer games, multi-device document synchronization, and edge networks. These scenarios require synchronization to achieve consistency, usually entailing elaborate design and coding. For instance, considerations for network issues or concurrent write operations are necessary. However, for a wide range of applications CRDTs can simplify the code significantly:

CRDTs can automatically merge concurrent writes without conflicts.
Fewer abstractions. There's no need to design specific backend database schemas, manually execute expected conflict merges, or implement interfaces to memory and memory to persistent structure conversions.
Offline supports are right out of the box

What are CRDTs

crdt-richtext - Rust implementation of Peritext and Fugue

Thu, 20 Apr 2023 00:00:00 GMT

crdt-richtext: Rust implementation of Peritext and Fugue

Presenting a new Rust crate that combines Peritext and Fugue's power with impressive performance, tailored specifically for rich text. This crate's functionality is set to be incorporated into Loro, a general-purpose CRDT library currently under development.

What’s Peritext

Peritext: A CRDT for Rich-Text Collaboration

Peritext is a novel rich-text CRDT (Conflict-free Replicated Data Type) algorithm. It is capable of merging concurrent edits in rich text format while preserving users' intent as much as possible. Its primary focus is on merging the formats and annotations of rich text content, such as bold, italic, and comments.

💡 The specific definition of user intent in the context of concurrent rich text editing can't be clearly explained in a few words. it's best understood through specific examples.

Peritext is designed to solve a couple of significant challenges:

Firstly, it addresses the anticipated problems arising from conflicting style edits. For instance, consider a text example, "The quick fox jumped." If User A highlights "The quick" in bold and User B highlights "quick fox jumped," the ideal merge should result in the entire sentence, "The quick fox jumped," being bold. However, existing algorithms might not meet this expectation, resulting in either "The quick fox" or "The" and "jumped" being bold instead.

Original Text	The quick fox jumped
Concurrent Edit from A	The quick fox jumped
Concurrent Edit from B	The quick fox jumped
Expected Merged Result	The quick fox jumped
Bad case from merging Markdown text directly	The quick fox jumped
Bad case from Yjs	The quick fox jumped

Additionally, Peritext manages conflicts between style and text edits. In the same example, if User A highlights "The quick" in bold, but User B changes the text to "The fast fox jumped," the ideal merge should result in "The fast" being bold.

Original Text	The quick fox jumped
Concurrent Edit from A	The quick fox jumped
Concurrent Edit from B	The fast fox jumped
Expected Merged Result	The fast fox jumped

What’s more, Peritext takes into account different expectations for expanding styles. For example, if you type after a bold text, you would typically want the new text to continue being bold. However, if you're typing after a hyperlink or a comment, you likely wouldn't want the new input to become part of the hyperlink or comment.

What’s Fugue

Fugue is a new CRDT text algorithm, presented in The Art of the Fugue: Minimizing Interleaving in Collaborative Text Editing by Matthew Weidner et al., nicely solves the interleaving problem.

The interleaving problem

The interleaving problem was proposed in the paper Interleaving anomalies in collaborative text editors by Martin Kleppmann et al.

An example of interleaving:

A type "Hello " from left to right/right to left
B type "Hi " from left to right/right to left
The expected result: "Hello Hi " or "Hi Hello "
The interleaving result may look like: "HHeil lo"
- This happens when typing from right to left in RGA.

An example of an interleaving anomaly when using fractional indexing CRDT on text content. Source: **Martin Kleppmann, Victor B. F. Gomes, Dominic P. Mulligan, and Alastair R. Beresford. 2019. Interleaving anomalies in collaborative text editors. https://doi.org/10.1145/3301419.3323972

The Fugue paper summarizes the current state of the interleaving problems in the table.

Source: Weidner, M., Gentle, J., & Kleppmann, M. (2023). The Art of the Fugue: Minimizing Interleaving in Collaborative Text Editing. ArXiv. /abs/2305.00583

The interleaving problem sometimes are unsolvable when there are more than 2 sites. See Fugue paper Appendix B, Proof of Theorem 5 for detailed explanation.

The case where the interleaving problem is unsolvable Source: Weidner, M., Gentle, J., & Kleppmann, M. (2023). The Art of the Fugue: Minimizing Interleaving in Collaborative Text Editing. ArXiv. /abs/2305.00583

However, we can still minimize the chance of interleaving. Fugue introduces the concept of maximal non-interleaving and solves it with an elegant algorithm that is easy to optimize. The definition of maximal non-interleaving makes a lot of sense to me and leaves little room for ambiguity. I won't reiterate the definition here. But the basic idea is first to solve forward interleaving by leftOrigin. If there is still ambiguity, then solve the backward interleaving by rightOrigin. (The leftOrigin and rightOrigin refer to the ids of the original neighbors when the character is inserted, just like Yjs)

CRDT-Richtext

Based on the algorithms of Peritext and Fugue, we made crdt-richtext, a lib written in Rust that provides a wasm interface. It’s available on crates.io and npm now.

Example


const text = new RichText(BigInt(1));
text.insert(0, "你好，世界！");
text.insert(2, "呀");
expect(text.toString()).toBe("你好呀，世界！");
text.annotate(0, 3, "bold", AnnotateType.BoldLike);
const spans = text.getAnnSpans();
expect(spans.length).toBe(2);
expect(spans[0].text).toBe("你好呀");
expect(spans[0].annotations.size).toBe(1);
expect(spans[0].annotations.has("bold")).toBeTruthy();
expect(spans[1].text.length).toBe(4);

const b = new RichText(BigInt(2));
b.import(text.export(new Uint8Array()));
expect(b.toString()).toBe("你好呀，世界！");

Data structure

We heavily use B-Trees to optimize our algorithm. We made a library called generic-btree, which is written in safe Rust code, which provides a flexible foundation for our optimization efforts.

https://github.com/loro-dev/generic-btree

The cached content inside B-Tree

There are several common tasks we need to address in Text CRDT, including:

Finding, inserting, or deleting content at a given index:
- We use a BTree to look up and update the content
- The time complexity is O(logN), where N is the length of the content
Finding content with a given op ID:
- We use a combination of HashMap and BTree
- The time complexity if O(logN), where N is the number of operations
Compressing content in memory:
- To reduce the amount of memory used by storing every operation in raw format, we compress the content using the RLE tricks from Yjs and DiamondTypes.
  - The insight behind this compression is that neighboring inserts and deletions tend to be continuous, so we can merge them and store less metadata.
- Commonly, every leaf node in the diagram contains a dozen of characters
Converting index between UTF-16 and UTF-8:
- In JS, the default encoding of a string is utf16, but in Rust, the default one is utf8. Although the WASM interface can help us convert the encoding of the string, we still need to convert the index of the operation.
- To solve this, crdt-richtext also store the UTF-16 length of the content in B-Tree. So we can query the B-Tree with either the utf8 index or the utf16 index.
Storing the boundary of style/format/comments:
- We use the same B-Tree to store the boundary, with each subtree corresponding to a span of text or tombstones. For each node in the tree, we store which annotations start before it, start after it, end before it, or end after it.
```
#[derive(Debug, PartialEq, Eq, Default, Clone)]
pub struct ElemAnchorSet {
    start_before: FxHashSet,
    end_before: FxHashSet,
    start_after: FxHashSet,
    end_after: FxHashSet,
}
```
- This is basically the same optimization as Peritext, except we do it on the tree.

Encoding

We use columnar encoding, which was first adopted to CRDTs by Martin Kelppmann in automerge. To make it easier in Rust, we created the lib Serde Columnar: Ergonomic columnar storage encoding crate.

Heavily tested by libFuzzer

Test-Driven Development (TDD) provides an amazing development experience. If possible, I always write unit tests for a standalone module before moving forward. However, for algorithms like CRDTs, it is infeasible to list all possible cases manually but is easy to generate test cases automatically. This is where fuzzing tests come into play.

Some fuzzers can track coverage information and generate mutations on the input data to maximize code coverage. LibFuzzer can also identify memory leaks and UAF problems.

[cargo-fuzz](https://www.notion.so/crdt-richtext-Rust-implementation-of-Peritext-and-Fugue-c49ef2a411c0404196170ac8daf066c0?pvs=21) provides a user-friendly API for writing fuzzing tests, and it supports two fuzzers: libFuzzer and AFL. It makes the unstructured libFuzzer feel structured. So we’re able to write fuzzing tests in this way

use arbitrary::Arbitrary;

#[derive(Arbitrary, Clone, Debug, Copy)]
pub enum Action {
    Insert {
        actor: u8,
        pos: u8,
        content: u16,
    },
    Delete {
        actor: u8,
        pos: u8,
        len: u8,
    },
    Annotate {
        actor: u8,
        pos: u8,
        len: u8,
        annotation: AnnotationType,
    },
    Sync(u8, u8),
}

pub fn fuzzing(actions: Vec) {
	// run tests based on actions
	...
}

#![no_main]
use libfuzzer_sys::fuzz_target;

fuzz_target!(|actions: [Action; 100]| { fuzzing(actions.to_vec()) });

We will run millions of Fuzzing Tests after making big changes. The fuzzer can help us extract the most useful thousands of tests to be included into the corpus. The minor changes can be verified by running the corpus.

We use fuzzing tests in Loro's CRDTs too. This test suite is like our safety net when we're making big tweaks to the code. It's great at spotting all our little slip-ups.

Performance

Benchmark

Benchmark setup
B4: Real-world editing dataset
Replay a real-world editing dataset. This dataset contains the character-by-character editing trace of a large-ish text document, the LaTeX source of this paper: https://arxiv.org/abs/1608.03960(opens in a new tab) Source: https://github.com/automerge/automerge-perf/tree/master/edit-by-index(opens in a new tab)
- 182,315 single-character insertion operations
- 77,463 single-character deletion operations
- 259,778 operations totally
- 104,852 characters in the final document We simulate one client replaying all changes and storing each update. We measure the time to replay the changes and the size of all update messages (updateSize), the size of the encoded document after the task is performed (docSize), the time to encode the document (encodeTime), the time to parse the encoded document (parseTime), and the memory used to hold the decoded document in memory (memUsed).
[B4 x 100] Real-world editing dataset 100 times

Replay the [B4] dataset one hundred times. The final document has a size of over 10 million characters. As comparison, the book "Game of Thrones: A Song of Ice and Fire" is only 1.6 million characters long (including whitespace).
- 18,231,500 single-character insertion operations
- 7,746,300 single-character deletion operations
- 25,977,800 operations totally
- 10,485,200 characters in the final document

The benchmark was conducted on a 2020 M1 MacBook Pro 13-inch on 2023-05-11.

N=6000	crdt-richtext-wasm	loro-wasm	automerge-wasm	tree-fugue	yjs	ywasm
[B4] Apply real-world editing dataset (time)	176 +/- 10 ms	141 +/- 15 ms	821 +/- 7 ms	721 +/- 15 ms	1,114 +/- 33 ms	23,419 +/- 102 ms
[B4] Apply real-world editing dataset (memUsed)	skipped	skipped	skipped	2,373,909 +/- 13725 bytes	3,480,708 +/- 168887 bytes	skipped
[B4] Apply real-world editing dataset (encodeTime)	8 +/- 1 ms	8 +/- 1 ms	115 +/- 2 ms	12 +/- 0 ms	12 +/- 1 ms	6 +/- 1 ms
[B4] Apply real-world editing dataset (docSize)	127,639 +/- 0 bytes	255,603 +/- 8 bytes	129,093 +/- 0 bytes	167,873 +/- 0 bytes	159,929 +/- 0 bytes	159,929 +/- 0 bytes
[B4] Apply real-world editing dataset (parseTime)	11 +/- 0 ms	2 +/- 0 ms	620 +/- 5 ms	8 +/- 0 ms	43 +/- 3 ms	40 +/- 3 ms
[B4x100] Apply real-world editing dataset 100 times (time)	15,324 +/- 3188 ms	12,436 +/- 444 ms	skipped	91,902 +/- 863 ms	112,563 +/- 3861 ms	skipped
[B4x100] Apply real-world editing dataset 100 times (memUsed)	skipped	skipped	skipped	224076566 +/- 2812359 bytes	318807378 +/- 15737245 bytes	skipped
[B4x100] Apply real-world editing dataset 100 times (encodeTime)	769 +/- 37 ms	780 +/- 32 ms	skipped	943 +/- 52 ms	297 +/- 16 ms	skipped
[B4x100] Apply real-world editing dataset 100 times (docSize)	12,667,753 +/- 0 bytes	26,634,606 +/- 80 bytes	skipped	17,844,936 +/- 0 bytes	15,989,245 +/- 0 bytes	skipped
[B4x100] Apply real-world editing dataset 100 times (parseTime)	1,252 +/- 14 ms	170 +/- 15 ms	skipped	368 +/- 13 ms	1,335 +/- 238 ms	skipped

The complete benchmark result and code is available at https://github.com/https://twitter.com/zx_loro/fugue-bench.

It is worth noting that:

The benchmark for Automerge is based on automerge-wasm, which is not the latest version of Automerge 2.0.
crdt-richtext and fugue are special-purpose CRDTs that tend to be faster and have a smaller encoding size.
The encoding of yjs, ywasm, and loro-wasm still contains redundancy that can be compressed significantly. For more details, see the full report.
loro-wasm and fugue only support plain text for now

Discussion

CRDT-richtext: Rust implementation of Peritext and Fugue | Hacker News

Loro Blog

Mergeable Containers: Fixing Concurrent Child Creation

Why This Happens

Why Root Containers Are Naturally Mergeable

API: Explicitly Ensuring a Mergeable Child

Core Design: Deterministic CID + Map Slot Marker

1. CID: A Synthetic Root Container ID

Loro Protocol

Loro Protocol

Quick Start: Server & Client Example

Features

Multiplexing

Compatibility

Experimental E2E Encryption

Status and Licensing

Loro Mirror: Make UI State Collaborative by Mirroring to CRDTs

Loro Mirror: Make UI State Collaborative by Mirroring to CRDTs

Overview

Why this exists

What Mirror provides

How to use

Basic Example

React Example

Where we’re going

Loro 1.0

Movable tree CRDTs and Loro's implementation

Movable tree CRDTs and Loro's implementation

Background

Conflicts in Movable Trees

Deletion and Movement of the Same Node

Moving the Same Node Under Different Parents

Movement of Different Nodes Resulting in a Cycle

Ancestor Node Deletion and Descendant Node Movement

How Popular Applications Handle Conflicts

Movable Tree CRDTs

A highly-available move operation for replicated trees

Globally Ordered Logical Timestamps

Apply a Remote Operation

Undo Recent Ops

Apply the Remote Op

Reapply Undone Ops

CRDT: Mutable Tree Hierarchy

Movable Tree CRDTs implementation in Loro

Potential Conflicts in Child Node Sorting

Implementation and Encoding Size

Related work

Benchmark

Usage

Demo

Summary

Introduction to Loro's Rich Text CRDT

Background

Loro: Reimagine State Management with CRDTs

Envisioning the Local-First Development Paradigm

crdt-richtext - Rust implementation of Peritext and Fugue

crdt-richtext: Rust implementation of Peritext and Fugue

What’s Peritext

What’s Fugue

The interleaving problem

CRDT-Richtext

Example

Data structure

Encoding

Heavily tested by libFuzzer

Performance

Benchmark

B4: Real-world editing dataset

[B4 x 100] Real-world editing dataset 100 times

Discussion