Create README.md by Mikerah · Pull Request #1 · ChainSafe/lodestar

Mikerah · 2018-06-22T20:09:13Z

Added an overview of the project.
Will be adding more details as it gets built out.

Merge from Chainsafe master

Temp/validator remote

Ran into this myself while setting `lodestar` up - found some old issues([ChainSafe#1](ChainSafe#3037), [ChainSafe#2](ChainSafe#1396)) dating back to 2020 indicating that others also ran into the same issue in the past, so this note is probably worth adding to `CONTRIBUTING.md`. Added a short note about what the error is about and what the solution is.

…sponse The responseEncodeError() function was yielding the error status byte and snappy-encoded error message as separate chunks through the async generator. When piped through libp2p stream.sink, this created a race condition where the stream could close after the status byte was flushed but before the error message bytes arrived on the reader side. This resulted in the requesting side receiving the correct error status code but an empty errorMessage, causing flaky failures in the e2e reqresp tests ('should handle a server error' and 'should handle a server error after emitting two blocks'). These two tests were the #1 cause of CI E2E failures, appearing in ~90% of all E2E test failures. The fix collects the status byte and encoded error message into a single Buffer.concat() yield, ensuring they are delivered atomically through the stream.

…sponse (#8908) ## Motivation The e2e reqresp tests "should handle a server error" and "should handle a server error after emitting two blocks" have been consistently flaky, appearing in **~90% of all CI E2E test failures**. Analysis of ~100 recent CI runs confirmed this as the #1 source of E2E flakiness. The failure pattern: ``` expected { code: "REQUEST_ERROR_SERVER_ERROR", errorMessage: "" } to deeply equal { code: "REQUEST_ERROR_SERVER_ERROR", errorMessage: "TEST_EXAMPLE_ERROR_1234" } ``` The error status code is received correctly, but the error message is empty. ## Root Cause `responseEncodeError()` yields the error status byte and snappy-encoded error message as **separate chunks** through the async generator: ```ts yield Buffer.from([status]); // chunk 1 yield* encodeErrorMessage(errorMessage, protocol.encoding); // chunk 2+ ``` When piped through `stream.sink`, libp2p can close/flush the stream after the first yield completes but before the subsequent error message chunks are delivered to the reader side. The `readErrorMessage()` function on the receiving end then finds no data after the status byte and returns an empty string. ## Fix Collect the status byte and encoded error message into a single `Buffer.concat()` yield, ensuring they are delivered atomically through the stream. This eliminates the race condition without changing the wire format. ## Notes - All existing reqresp unit tests pass (85/85) - The wire format is unchanged — the same bytes are sent, just in a single chunk instead of multiple - This is consistent with how other protocols handle similar issues (combining header + payload) > This PR was authored by an AI contributor. All code was reviewed by sub-agents before submission. --------- Co-authored-by: lodekeeper <lodekeeper@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <175061342+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Nico Flaig <nflaig@protonmail.com>

…8955) ## Motivation PR #8890 (libp2p v3) suffers from ~22% Unknown peer rate (vs ~4% on v2/unstable). Root cause: libp2p v3 enforces per-protocol stream limits (`identify: maxOutboundStreams=1`). When repeated STATUS messages trigger overlapping `identify()` calls for the same peer, v3 throws `TooManyOutboundProtocolStreamsError` which cascades into massive EOF failures (~5000 identify errors/2h on feat1 vs ~287 on unstable). ## Description Minimal fix — no retries, no backoff, no spray: - **Single in-flight identify per peer**: `identifyInProgress` map keyed by `PeerIdStr → connection.id`. Before calling `identify()`, checks if there's already one in-flight for the same connection. If so, skips. - **Event-driven fallback**: Listens to `peer:identify` events from libp2p (fired on successful identify or identify-push). Updates `agentVersion/agentClient` even if our explicit `identify()` failed earlier. - **Reconnect race safety**: Uses `connection.id` as epoch token. After `await identify()`, verifies the in-flight key still matches before writing results — a reconnect during the await clears the old entry, so stale results are discarded. - **Cleanup on disconnect**: Removes in-flight tracking when peer disconnects. ## Changes - `packages/beacon-node/src/network/peers/peerManager.ts`: Added `identifyInProgress` map, `onPeerIdentify` event handler, dedup guard in `onStatus`, stale-result guard in `identifyPeer()`, cleanup in disconnect handler - `packages/beacon-node/test/e2e/network/peers/peerManager.test.ts`: 4 new tests (dedup, reconnect race, event-driven fallback, existing flow preserved) ## Evidence Loki log comparison (2h window): - **feat1 (v3)**: ~5000 identify errors — 3794 EOF, 863 EOF-while-reading, 283 missing public key, 26 too-many-outbound-streams - **unstable (v2)**: ~287 identify errors — 143 unexpected-end, 82 timeouts ``` # The overlapping call pattern (before fix): STATUS #1 → identify() in-flight STATUS #2 → identify() overlaps → TooManyOutboundProtocolStreamsError → EOF cascade # After fix: STATUS #1 → identify() in-flight, tracked in identifyInProgress STATUS #2 → sees in-flight marker, skips peer:identify event → updates agentVersion as safety net ``` > Note: This is a replacement for the retry-based approach in #8954. That PR added retry machinery which masked the root cause rather than preventing overlapping calls. ## AI Disclosure This PR was authored with AI assistance (Claude Opus 4.6 via OpenClaw). All code was reviewed and tested by the AI agent. Co-authored-by: lodekeeper <lodekeeper@users.noreply.github.com>

Create README.md

81d86ff

ChainSafeSystems merged commit f59f4f3 into ChainSafe:master Jun 23, 2018

wemeetagain pushed a commit that referenced this pull request Sep 3, 2019

Merge pull request #1 from ChainSafe/master

a06f85e

Merge from Chainsafe master

wemeetagain mentioned this pull request Nov 14, 2019

Light Client Task Force #2 - Multiproof Formats & Proof Negotiation #555

Closed

twoeths mentioned this pull request Dec 8, 2020

enable signature verification in initial sync #1217

Closed

dapplion pushed a commit that referenced this pull request Jan 19, 2022

Merge pull request #1 from novon-labs/temp/validator-remote

85ceae0

Temp/validator remote

twoeths mentioned this pull request May 28, 2025

Lodestar post-Pectra: Packing issues with newly seen attestations #7883

Open

twoeths mentioned this pull request Jul 4, 2025

PeerDAS: rate limit when syncing #8033

Closed

lodekeeper mentioned this pull request Jan 31, 2026

feat: support connecting multiple external signers to validator client #8822

Merged

lodekeeper-z mentioned this pull request Feb 3, 2026

PeerDAS Metrics Revamp: Fix phantom metrics, add missing dashboard panels, improve observability #8850

Open

13 tasks

lodekeeper mentioned this pull request Feb 14, 2026

fix: combine error status and message into single chunk in reqresp response #8908

Merged

GrapeBaBa mentioned this pull request May 1, 2026

feat: integrate fork-choice compliance spec test suite #9314

Draft

4 tasks

lodekeeper mentioned this pull request May 30, 2026

feat: fast confirmation rule #8837

Merged

nflaig mentioned this pull request Jun 9, 2026

fix: bound SeenPayloadEnvelopeInput cache size to prevent OOM #9489

Open

1 task

This was referenced Jun 11, 2026

feat: backfill sync v2 #9238

Open

SeenPayloadEnvelopeInput cache missing pruning mechanisms #9073

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Create README.md#1

Create README.md#1
ChainSafeSystems merged 1 commit into
ChainSafe:masterfrom
Mikerah:patch-1

Mikerah commented Jun 22, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Mikerah commented Jun 22, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants