How is ArrayDeque implemented (circular buffer)?

The Queue throw-vs-return method matrix, why ArrayDeque replaces Stack, and the PriorityQueue iterator trap.

Cracked Java

ArrayDeque is a resizable circular buffer with two index pointers (head and tail) and a size that is always a power of two. Operations on either end use bitmask arithmetic (index & (capacity - 1)) instead of modulo, making addFirst, addLast, pollFirst, and pollLast all O(1) amortized.

The data structure

public class ArrayDeque<E> {
    transient Object[] elements;  // length is power of two
    transient int head;           // index of first element (or 0 if empty)
    transient int tail;           // index where the next element would go
    // size() == (tail - head) & (elements.length - 1)
}

head and tail move independently; when one reaches the end of the array, it wraps to index 0. The buffer is full when head == tail after an insert, which triggers a resize (double capacity).

capacity = 8 (power of two)
elements: [ E, F, _, _, _, A, B, C, D ]
              ^         ^
            tail      head

addLast(G): writes G at index 2, tail moves to 3
elements: [ E, F, G, _, _, A, B, C, D ]
                 ^         ^
               tail      head

pollFirst(): reads A at index 5, head moves to 6
returns A, deque is now: B, C, D, E, F, G

ArrayDeque circular buffer layout

Why power-of-two capacity

To wrap an index from N back to 0 you'd normally write i % capacity. But % is slow on integer dividers. If capacity is a power of two, i % capacity == i & (capacity - 1) — a single bitwise AND. ArrayDeque hardcodes powers of two so every index computation is a bitmask.

private static int inc(int i, int modulus) {
    if (++i >= modulus) i = 0;
    return i;
}
// fast-path equivalent for power-of-two modulus:
// i = (i + 1) & (modulus - 1);

Resize policy

When the buffer is full, ArrayDeque allocates a new array of double capacity and copies elements so that head is at index 0 and tail is at index size. This is O(n) but amortizes to O(1) per insertion.

The initial default capacity is 16. You can pre-size with new ArrayDeque<>(expectedSize) — the constructor rounds up to the next power of two.

Why not LinkedList?

LinkedList also implements Deque. Both offer O(1) addFirst/addLast/removeFirst/removeLast. But:

LinkedList allocates a Node object per element (24+ bytes of overhead each, plus GC pressure).
LinkedList nodes are scattered across the heap — terrible cache locality.
ArrayDeque stores elements contiguously in an Object[] — cache-friendly.
ArrayDeque requires power-of-two capacity (small memory overhead) but no per-element overhead.

Microbenchmarks consistently show ArrayDeque is 2-4x faster than LinkedList for both queue and stack workloads. The JDK Javadoc explicitly recommends ArrayDeque over LinkedList for stack and queue use.

Limitations

Not thread-safe — wrap with Collections.synchronizedDeque or use ConcurrentLinkedDeque/LinkedBlockingDeque.
Doesn't allow null — null elements would conflict with the "empty slot" sentinel; rejected with NPE.
Iterator is fail-fast — concurrent modification during iteration throws ConcurrentModificationException.

Operation	Best	Average	Worst	Note
addFirst / addLast / offerFirst / offerLast	O(1)	O(1)	O(n) on resize	Amortized O(1)
pollFirst / pollLast / peekFirst / peekLast	O(1)	O(1)	O(1)	Index arithmetic
contains / remove(Object)	O(n)	O(n)	O(n)	Linear scan
size	O(1)	O(1)	O(1)	Computed from head/tail