How to Use Cow<T> (Clone on Write) in Rust

When you want to avoid copying until you have to

You write a function that accepts a string, checks it for formatting issues, and returns the cleaned version. Most of the time, the input is already clean. Your function just reads it and hands it back. But sometimes the input has trailing whitespace or weird capitalization. When that happens, you need to mutate the string and return a new one.

If you accept String, the caller pays for an allocation even when they only wanted to pass a literal or a borrowed slice. If you accept &str, you cannot mutate the data without forcing the caller to clone it upfront. You are stuck between two bad choices: force unnecessary allocations, or force unnecessary clones.

Cow<T> solves this exact friction. The name stands for Clone on Write. It lets you accept borrowed data, read it freely, and only allocate a new owned copy when you actually need to change it. The compiler handles the branching logic. Your API stays flexible. Your runtime stays lean.

The concept in plain words

Cow is an enum with two variants. Borrowed holds a reference to existing data. Owned holds a freshly allocated copy. The type parameter T must implement the Borrow trait, which tells Rust how to treat the owned version as a borrowed one. For strings, that means Cow<str> can hold either &str or String. For byte slices, Cow<[u8]> can hold either &[u8] or Vec<u8>.

Think of a shared whiteboard in a conference room. Everyone can read the diagram drawn on it. No one needs to make a copy just to look at it. If someone needs to add their own notes, they grab a fresh whiteboard, trace the original diagram onto it, and start writing. The original board stays untouched for everyone else. Cow works the same way. It points to the original data until you call the method that triggers a clone. Until that moment, zero bytes are copied.

The lifetime parameter on Cow<'a, T> tracks how long the borrowed variant is valid. When you hold a Cow<'a, str>, the compiler knows that if it is currently Borrowed, the underlying &str lives at least as long as 'a. If it becomes Owned, the lifetime no longer matters because the String owns its own heap allocation. This dual nature is what makes Cow safe without requiring unsafe code.

A minimal example

use std::borrow::Cow;

/// Demonstrates the two states of Cow and the trigger for cloning.
fn main() {
    // Start with a borrowed slice. No heap allocation happens here.
    let mut data: Cow<str> = Cow::Borrowed("initial text");

    // Read the data directly. Cow implements Deref, so it behaves like &str.
    println!("Before mutation: {}", data);

    // Call to_mut() to request a mutable reference.
    // If data is Borrowed, Cow clones it into an Owned String.
    // If data is already Owned, to_mut() just returns a &mut String.
    let owned_slice = data.to_mut();

    // Now we hold a mutable reference to the owned heap allocation.
    owned_slice.push_str(" that was modified");

    // The Cow variant has switched from Borrowed to Owned.
    println!("After mutation: {}", data);
}

Run this code and you will see the exact moment the allocation happens. The first print statement reads directly from the string literal. The second print statement reads from a heap-allocated String. The transition is explicit, predictable, and controlled by your code.

What happens under the hood

When you create Cow::Borrowed("hello"), Rust stores a pointer and a length inside the enum. The discriminant points to the Borrowed branch. Memory usage is identical to a raw &str. No allocator is touched.

When you call to_mut(), the runtime checks the discriminant. If it sees Borrowed, it invokes the ToOwned trait implementation for the underlying type. For str, that means String::from(borrowed_slice). The allocator reserves space, copies the bytes, and the enum discriminant flips to Owned. The old reference is dropped. If the discriminant already points to Owned, to_mut() skips the clone and returns a mutable reference to the existing String.

This pattern gives you a zero-cost abstraction for the read path and a predictable allocation for the write path. The compiler inlines the discriminant check in release builds. Branch prediction usually handles it efficiently because the read path dominates in most workloads. You get the flexibility of an enum without the overhead of virtual dispatch or heap allocation on the hot path.

Realistic usage: processing text without forcing allocations

Real code rarely mutates strings in a vacuum. You usually have a pipeline that reads configuration, parses logs, or normalizes user input. Cow shines when you want to write a function that accepts any string-like input but only pays for allocation when necessary.

use std::borrow::Cow;

/// Normalizes whitespace in a string. Returns borrowed data if no changes are needed.
/// Returns owned data only when actual modifications occur.
fn normalize_whitespace(input: &str) -> Cow<str> {
    // Check if the string already meets our criteria.
    // Avoids allocation entirely when the input is clean.
    if input.chars().all(|c| c.is_alphanumeric() || c == ' ') {
        return Cow::Borrowed(input);
    }

    // We need to modify the string.
    // Collect into a String to trigger the owned allocation.
    let cleaned: String = input
        .chars()
        .filter(|c| c.is_alphanumeric() || c == ' ')
        .collect();

    // Wrap the result in Cow::Owned so the return type stays consistent.
    Cow::Owned(cleaned)
}

fn main() {
    // Pass a literal. The function returns Borrowed.
    let clean = normalize_whitespace("rustlang");
    println!("Clean input: {}", clean);

    // Pass a string with punctuation. The function returns Owned.
    let dirty = normalize_whitespace("hello, world!!!");
    println!("Dirty input: {}", dirty);
}

This pattern is common in parsers, serializers, and text processors. The caller never needs to decide upfront whether to allocate. They pass a &str. The function decides whether to clone. The return type Cow<str> tells the caller: you can read this immediately, and if you need to mutate it later, just call to_mut(). The allocation boundary stays inside the function that actually needs it.

Pitfalls and compiler friction

Cow is straightforward, but it trips up developers who expect automatic cloning on every mutation. Cow does not implement DerefMut for the borrowed variant. If you try to mutate through a Cow without calling to_mut() first, the compiler rejects you with E0596 (cannot borrow as mutable, as it is not declared mutable). The enum discriminant is immutable by default, and the compiler refuses to let you accidentally mutate a borrowed slice.

Another common friction point is trait bounds. Cow<T> requires T: Borrow<T> + ToOwned. If you try to use Cow with a type that does not implement ToOwned, the code will not compile. The compiler will emit E0277 (trait bound not satisfied) and point directly at the missing trait. This is intentional. Cow needs a guaranteed way to convert borrowed data into owned data when the write path triggers.

You will also encounter lifetime mismatches when returning Cow from functions that capture references. If your function returns Cow<'a, str> but the input reference only lives for a shorter scope, the compiler will force you to shorten the lifetime or return Cow<'static, str>. Treat lifetimes as contracts. If you cannot guarantee the borrowed data outlives the Cow, switch to Owned upfront or restructure the scope.

Convention aside: the community prefers calling Cow::Borrowed(data) and Cow::Owned(data) explicitly rather than relying on From conversions. It makes the intent obvious in code reviews. When you see Cow::Borrowed, you know no allocation happened. When you see Cow::Owned, you know the heap was touched. Clarity beats cleverness here.

When to reach for Cow

Use Cow<T> when you are writing an API that accepts borrowed data but occasionally needs to mutate it, and you want to avoid forcing the caller to allocate upfront. Use Cow<T> when you are building a parser or transformer that returns the original data unchanged in the common case, but needs to return a modified copy in the error or edge case. Use Cow<T> when you want to unify &T and T behind a single type without boxing or adding generic parameters.

Reach for plain &T when you only ever read the data and never need to mutate it. The borrow checker handles this efficiently, and Cow adds unnecessary enum overhead. Reach for owned T (like String or Vec<T>) when mutation is guaranteed or happens on every call. The allocation cost is already paid, so the Cow indirection only hurts performance. Reach for Rc<T> or Arc<T> when multiple owners need to share the same data and you need reference counting instead of clone-on-write semantics.

Counter-intuitive but true: Cow is not a performance silver bullet. It trades a small amount of compile-time complexity for runtime allocation flexibility. Measure your hot paths. If you are mutating on 90 percent of calls, drop Cow and use owned types. If you are mutating on 10 percent or less, Cow pays for itself.

Where to go next

Cow stands for Clone-on-Write and lets you hold data that might be borrowed or owned without knowing which one it is until you try to change it. It saves memory by avoiding a copy if you only read the data, but automatically makes a copy if you need to edit it. Think of it like a library book: you can read it without owning it, but if you want to write notes in it, you must buy your own copy first.