Info Asymmetry (a.k.a the head start problem) with ePBS

From Pre-Execution Privacy in PBS:

For example, the proposed design gives the winning builder a headstart in knowing what the next state of the chain will be and an incentive to delay revelation of that state as long as possible. This constitutes a research direction in itself.

I’ve been thinking about this exact issue in the back of my mind for the past several months. It is not only a problem with Ethereum’s PTC proposal, but also with ePBS designs for layer 2s and other layer 1s that use different kinds of consensus/sequencing protocols (such as optimistically responsive or DAG protocols). I don’t have definitive answers, but I’ll share my informal thoughts on this topic. I’d love to start a discussion about this.

Goal: Determine if the information asymmetry problem is actually a problem.

Key questions to answer:

  1. Does the information asymmetry between the incumbent builder and other builders give the incumbent builder a meaningful advantage over other builders in the market?
    • If so, how does this advantage affect the properties ePBS/Ethereum is trying to achieve?
  2. If so, how can we design ePBS protocols to mitigate this advantage (either through incentives or protocol design)?
  3. How can we generalize this analysis to more broadly analyze resource asymmetry among builders?

Part I: Does the information asymmetry between the incumbent builder and other builders give the incumbent builder a meaningful advantage over other builders in the market?

First, let’s offer one possible, informal definition of the problem.

Let i represent the index of an arbitrary builder in the system. Let j represent a slot index in Ethereum. Let B_{j_i} represent a block built by builder i for slot j. Let P_{j_i} represent the profit builder i would earn on a block B_{j_i} if it is committed to the network. Let R_{j_i} represent the revenue a builder would earn on said block. Let b_{j_i} represent the bid amount from builder i to proposer of slot j.

P_{B_{j_i}} = R_{B_{j_i}} - b_{B_{j_i}}

Let s_i(\Delta) represent the strategy function of builder i, where \Delta is the duration over which the strategy executes (i.e. the time a builder has to build a block between when they receive the latest Ethereum state from the previous slot to when they must commit to providing a particular block for the current slot). Let \Delta` > \Delta.

R'_{B_{j_i}} = s_i(\Delta)

Let R_{B_{j_i}}' represent the additional profit builder i can earn in slot j with s_i(\Delta’) in comparison to the maximum revenue any builder (including itself) would earn with s_k(\Delta).

R'_{B_{j_i}} = s_i(\Delta’) - \max{\{s_k(\Delta)}\} \text{ : }\forall k \in \{0, n\}

To simplify the problem definition let’s assume the following properties:

  • Proposers always choose the highest bid available
  • Builders use the same strategy for every slot
  • The network is synchronous; builders and proposers have synchronized clocks. We don’t need to worry about builders’ bids / blocks being delayed by the network
  • All builders aim to generate the same profit (i.e. P_{B_{l_k}} = c: \forall k \in \{0,n \}
  • \forall s_k(\Delta) \text{ where } k \in \{0,n\}: \Delta \rightarrow \infty, s_k(\Delta) \rightarrow \infty; all strategies are rising functions over time \Delta

It should follow: if a builder is able to generate more revenue for some block B, they can bid higher for said block to the proposer.

Next, let’s define what a meaningful advantage might be.

One potential definition of meaningful:

R_{B_{j_i}} is meaningful if it causes builder i to win the next slot’s bid auction more often than it otherwise would without the advantage. Thus meaningful advantage for a single slot (under our simplified model) is when:

\theta(R_{B_{j_i}}) > x

where x is some threshold considered tolerable by the network and \theta is the bid-win-rate of the builder. In the most conservative case x = 0. In reality the network may be comfortable with some small threshold, such as 5% (as averaged over many slots).

An observer might recognize that according to the above definition some profit advantage may exist in Ethereum (and similar systems) already. For example, if the proposer of slot j-1 in current-state Ethereum acts as their own builder, they will have access to slot j - 1 ’s state before the rest of the network. This could give them some advantage in bidding to the proposer of slot j.

It is likely, though, that there exist thresholds of advantage such that only certain values of \Delta` give builders meaningful advantage. It is also possible some builders will be less affected by the advantage of another builder, depending on that builder’s strategy. For example, a builder whose strategy relies heavily on private mempool transactions and little on the latest Ethereum state may be minimally affected by the advantage of the incumbent builder (i.e. their bid-win-rate stays the same regardless of the advantage).

Does this problem affect properties of ePBS or Ethereum?

Some potential ways the information asymmetry problem may play out:

  • If the advantage of an incumbent builder is commonly large, the network may centralize on a single builder (or have long bouts where there is a single builder for many slots in a row). In the absence of partial block auctions, this may give the centralized builder monopolistic pricing power during their reign.
    • The market may still converge on a single builder if such a builder can consistently outbid others. However, the key difference is that with ePBS the protocol itself is giving a builder this advantage; therefore no longer supporting a fair, competitive market
  • If the advantage is at least moderate, we may see more effort to extract multi-block MEV opportunities.
  • In optimistically responsive protocols (such as those in use by layer 2s or other layer 1s), the builder has an incentive to delay releasing the block data, which forces a necessary delay in the protocol that optimistic responsiveness tries to avoid.

Finally, how do we find the answer?

Some potential pathways:

  1. Analyze existing mev-boost data from the Flashbots relay.
    • Can we find signals to meaningful advantage in the case where builders also run proposers?
    • Can we analyze block contents to determine the kind of strategies builders use? Are these strategies heavily reliant on the previous slot’s state? Can we approximate a strategy curve for a few prominent builders?
    • How much do builder’s bids change over a slot? Do they remain consistent, or can builders offer significantly higher bids given more time to execute their strategy
    • Can we approximate the profit margin of different builders?
    • Can we analyze current consecutive builder behavior to determine if this advantage meaningfully exists in Ethereum today?
  2. Pursue a game-theoretic analysis
    • Investigate existing economic literature about market competition with information asymmetry (or resource asymmetry in general).
      • Can we apply existing economic models to better define/analyze this problem? Do these models capture complexities the simplistic model above does not? Where do the assumptions of existing economic models’ problem space and our problem space differ?
    • How do we anticipate builder strategies and resources changing over time, both with and without ePBS? Will builders converge on similar strategy curves in the long run?
    • Does this make current multi-block MEV strategies more viable?
    • How does this interplay with mev-burn?

Part II: If so, how can we design ePBS protocols to mitigate this advantage (either through incentives or protocol design)?

The answer to this second question relies on how the meaningful advantage is gained, and under what parameters. I have little to offer in this section, but here are a few preliminary ideas:

Let’s quickly look at a strawman solution of using slot auctions instead of block auctions. In a slot auction a builder does not have to commit to the contents of the block until it actually reveals the block data. Builders can submit bids to proposers and hope that by the time the actual block is released, they’ve been able to extract higher block revenue. If a non-incumbent builder is willing to take on additional risk, they may choose to bid competitively with the incumbent builder with the hope that by the time the block data is revealed, they’ve been able to actually extract enough revenue to match the bid they submitted to the proposer. Slot auctions could be an easy way to force the protocol to give other builders a minimum amount of block building time with the latest state. Perhaps the slot auction time can be tuned such that this minimal building time is enough to mitigate incumbent builder advantages. However, balancing the slot auction time with protocol latency will be challenging, particularly since the ideal slot auction time for mitigating the head start problem will change as builder strategies change. The slot auction also requires non-incumbent builders to take on more risk than the incumbent builder.

A second strawman solution could be to disallow the same builder from being selected in conseuctive slots. This would mitigate the information asymmetry problem almost completely. However this doesn’t seem like a good strategy since the same builder entity could easily change their on-chain identity each slot to circumvent this rule. It’s also not clear that consecutive builders with the same identity are always “bad.” (E.g. perhaps SUAVE becomes the incumbent builder; that’s considered a desirable outcome by many.)

Neither strawman, in my opinion, gives a proper solution.

Notes

  1. The above model is overly simplistic. In reality builders may not use the same strategy for each block, builders do not have equal profit margins among them, and builders may start building blocks before knowing the latest Ethereum state.
  2. The above model is not at all rigorous. It’s purely meant as a preliminary mental model. We should more explicitly define \theta, “advantage,” etc.
2 Likes

Thank you for the article.

Is this a typo? Should it instead be R'_{B_ji} ?

1 Like

Yes, you are correct! I’ve updated the post. Thank you!

2 Likes

Thanks for writing this up! Very nice :slight_smile:

At a high level I think there are some things we should try separate out. In one way of looking at it, the info asymmetry could mean that winning block n means that block n+1 is more likely. This might be the case because you are able to hedge more cheaply for the next block’s trades since no one else is trying to do that yet (dubious whether this plays a big role given the price might not have moved that much yet). Another example is for chains with fast block times where a little bit of a headstart could mean that you calculate arbs over routes longer than any of your competitors.
Another way of looking at things is that the info asymmetry value will be bid up in the block itself such that anyone who has this additional edge will have paid for it in the previous block, hence they don’t really have edge at all.
To me, understanding which of these two ends up being the case is the abstracted question we want to look at.

Some smaller points:

One thing that’s missing with this framing, I think is that the advantage is the important part here. So the value of the strategy could also depends on how much time competitors have. This could be, for example, because having a headstart means hedging for a certain opportunity is cheaper. Maybe we don’t want to model that, but just raising it.

Why this property?

So you’re thinking of theta as giving the win rate for slot n+1 given a certain revenue at block n? We would only really care that the revenue is non-zero as this indicates winning the auction at n right?

Perhaps the slot auction time can be tuned such that this minimal building time is enough to mitigate incumbent builder advantages.

I don’t see why this is different in slot vs block auctions?

Relevant links:

1 Like

Thank you so much the response! My apologies for the late reply! I’m still reading through the links you posted as well.

TL;DR

The actual question to ask is whether or not builders have incentives to delay releasing data and if this delay impacts the goals of the protocol. To me it seems that there is always incentives for builders to delay releasing the data. I need to think more about how this impacts the goals of a protocol.

Actual Response

Another way of looking at things is that the info asymmetry value will be bid up in the block itself such that anyone who has this additional edge will have paid for it in the previous block, hence they don’t really have edge at all.

This is very interesting! I hadn’t thought of this initially, but it makes total sense to me.

To me, understanding which of these two ends up being the case is the abstracted question we want to look at.

How can we answer this question? My initial thoughts are that this seems to be dependent on the sophistication of the builder. A sophisticated builder could take the time to properly model the info asymmetry advantage they get and price that into their bids, whereas a less sophisticated builder would not.

However, this leads me to ask why we have to differentiate between the two modes at all? After re-reading my own post and your original post, I’m wondering if I am asking the right questions. It seems like the question we actually want to answer is not really whether the info asymmetry problem exists, but whether or not builders are incentivized to delay releasing their block data and what effect that delay has on the goals of the protocol.

Are builders incentivized to delay releasing data?

In a protocol with a fixed slot time, such at Ethereum, it seems to me that so long as a builder releases the data in time to for the network to vote on it, there is no reason not to delay the release of data. Releasing the data faster does not earn them any extra profit over time. Even if the info asymmetry advantage is completely minimal, builders lose nothing by releasing the data late.

I think this same idea will hold most of the time for an optimistically responsive protocol. However, I could see an edge case where a builder is incentivized to release the block data as early as possible so long as they are still very confident they will win the next block. If they are sure they will win the next block and many blocks after that, then releasing the block data earlier means that the network commits blocks more frequently. This means that so long as they are the winning builder they will make more money over time since blocks are being committed to the network more quickly. However, I don’t think this will really be the case in practice.

As you mention in another section of your response, a builder’s strategy is dependent on the time they have to build blocks. It is possible that delaying the distribution of block data could also serve to give them more time to build the next block. For example, longer block times could mean better arbitrage opportunities with off-chain parties. So builders for an optimistically responsive protocol might delay releasing data just to force longer block times.

In conclusion, I don’t see any incentive a builder has to release the data as fast as they can.

How does this affect the goals of the protocol?

For an optimistically responsive protocol, this behavior is at odds with the protocol moving as fast as the network will allow.

I need to think more other effects, particularly for fixed-slot protocols.

Other thoughts:

One thing that’s missing with this framing, I think is that the advantage is the important part here

I completely agree! Overall, there are several parameters that should be inputs into the strategy functions.

Why this property?

This was just to simplify the model. If a builder is willing to take less profit on a block than other builders then they can bid higher to the proposer despite any headstart another builder may have. A more realistic model should include that some builders have lower profit margins than others.

So you’re thinking of theta as giving the win rate for slot n +1 given a certain revenue at block n? We would only really care that the revenue is non-zero as this indicates winning the auction at n right?

At the time of writing my original post, I was thinking of this as potential revenue rather than realized revenue. I was thinking that theta represented the chance a builder had to win the auction given a particular bid.

I don’t see why this is different in slot vs block auctions?

My initial thinking was that a slot auction gives a builder a bit more time to build their block once the previous block’s data has been released. But after thinking about it more, I agree that there isn’t a fundamental difference between the two. There isn’t anything that determines when over the course of the slot auction the previous data is available.