1. Definition

  • Team Draft Interleaving (TDI) is an online evaluation technique for comparing two ranking algorithms (say Algorithm A vs Algorithm B).
  • Like Balanced Interleaving, it builds a single mixed ranking list from both algorithms and shows it to users.
  • The difference is: TDI uses a draft-pick process (like picking players for a sports team) to decide which algorithm contributes the next item to the combined list.

This ensures fair representation and reduces bias from list position.


2. Why It’s Used

  • Problem with A/B testing: Splits traffic between A and B, requiring large samples.
  • Problem with Balanced Interleaving: Can sometimes favor one algorithm depending on overlap/order.
  • Solution – TDI: Randomizes the “drafting” process to ensure each algorithm gets equal opportunity.

3. How Team Draft Interleaving Works

  1. Take the top-k ranked results from Algorithm A and Algorithm B.
  2. Randomly select which algorithm gets the first pick.
  3. Alternate “draft picks”:
    • If A picked first, then B picks second, then A again, etc.
  4. Each algorithm selects its top-most unchosen item that hasn’t yet appeared in the combined list.
  5. Continue until the final interleaved ranking is filled.
  6. Show this interleaved list to the user.
  7. Attribute each user click to the algorithm that drafted that item.

4. Example

Suppose we have two algorithms:

  • Algorithm A: [A1, A2, A3, A4]
  • Algorithm B: [B1, B2, B3, B4]

Draft process (randomly chosen):

  • Round 1: A picks first → A1
  • Round 2: B picks next → B1
  • Round 3: A picks → A2
  • Round 4: B picks → B2
  • … and so on.

Final list (interleaved): [A1, B1, A2, B2, A3, B3, A4, B4]

If a user clicks B2, credit goes to Algorithm B.


5. Advantages of TDI

  • Fair & unbiased: Randomized drafting prevents systematic bias.
  • Efficient: Requires fewer users than A/B testing to detect differences.
  • Click attribution clarity: Each item is clearly “owned” by one algorithm.
  • Sensitive: Good at detecting small performance differences.

6. Limitations

  • Only compares two algorithms at a time.
  • Doesn’t naturally extend to >2 algorithms (though variations exist).
  • Still assumes clicks = relevance, which may be noisy.
  • Needs careful handling of ties and overlapping results.

7. Comparison: Balanced Interleaving vs Team Draft Interleaving

FeatureBalanced InterleavingTeam Draft Interleaving (TDI)
Mixing methodAlternate items from A and BRandomized draft pick between A & B
FairnessCan sometimes favor one algorithm if overlaps existMore balanced, avoids systematic bias
RandomizationLimitedStronger (random draft order each time)
Click attributionSometimes ambiguousAlways clear (each pick belongs to A or B)
Use caseSimple, small testsPreferred industry standard for fairness

In short:
Team Draft Interleaving (TDI) is an online evaluation method for comparing two ranking algorithms. It works by letting algorithms take turns “drafting” results into a shared ranking list. Users interact with the combined list, and clicks are credited to whichever algorithm drafted the item. TDI is considered more robust and fair than Balanced Interleaving.