Insights
How to Track 8-K Filing Frequency
Alphanume Team · June 9, 2026
Turning material-event filings into a time series that reads corporate attention, stress, and structural change more precisely than a raw filing count ever could.
Every time a material event occurs at a public company — a major acquisition closes, an executive departs, a lender declares a default, an unregistered securities sale is executed — the company is required to file a Form 8-K with the SEC, typically within four business days. The result, accumulated across thousands of issuers and decades of EDGAR history, is one of the richest event-signal archives available to quantitative researchers. But the value is not in the filing count alone. 8-K filing frequency broken down by Item code — the numbered disclosure categories embedded in every filing — tells a fundamentally different story than a simple tally. This post covers how to build that time series, what to do with it once you have it, and where the engineering traps are.
Alphanume's SEC Filing Intensity dataset operationalizes this signal at scale, providing per-issuer filing counts, Item-level breakdowns, and velocity metrics across rolling windows. That dataset is the practical starting point for analysts who want to incorporate 8-K frequency into a screen or model without rebuilding the data pipeline from scratch.
What the 8-K actually contains
The 8-K is the current-report form — the disclosure mechanism for events that cannot wait for the next quarterly or annual filing. Its structure is a numbered Item taxonomy, and each Item maps to a specific category of material event. Knowing the taxonomy is prerequisite to extracting anything useful from the filings.
The Items that appear most frequently in analytical contexts:
- Item 1.01 — Entry into a material definitive agreement. The catch-all for significant contracts: credit facilities, merger agreements, licensing deals, joint ventures. High frequency of 1.01 filings often reflects active deal-making or refinancing activity.
- Item 2.01 — Completion of acquisition or disposition. Filed when a previously announced transaction closes. The gap between a 1.01 filing and the corresponding 2.01 gives a measure of deal execution lag.
- Item 2.04 — Triggering events that accelerate or increase a direct financial obligation. This is the distress-signal Item — missed payments, covenant violations, and acceleration notices land here. A 2.04 filing is a hard flag.
- Item 3.02 — Unregistered sales of equity securities. Dilutive equity raises that bypass the S-1 or S-3 registration process. Frequent 3.02 filings at stressed companies signal cash-conservation mode.
- Item 5.02 — Departure or appointment of officers and directors. Leadership turnover. A cluster of 5.02 filings in a short window is a governance-disruption signal.
- Item 7.01 and 8.01 — Regulation FD disclosures and other events. These are the catch-all voluntary disclosure Items. Heavy 7.01/8.01 filing activity at a company that is simultaneously quiet on substance can indicate information management rather than genuine transparency.
An issuer that files six 8-Ks in a quarter, all 5.02 leadership changes, is experiencing something categorically different from an issuer that files six 8-Ks split across 1.01 material agreements and 2.01 acquisition completions. The Item mix is the signal; the raw count is only the carrier.
Building the 8-K frequency time series
The raw material is EDGAR's full-text submission index, published quarterly at /Archives/edgar/full-index/ and available as compressed pipe-delimited flat files. Each line in the form.idx file contains the CIK (issuer identifier), company name, form type, date filed, and the path to the filing document. Filtering form type to 8-K gives the universe of current reports.
The construction steps:
- Pull the index files. Download quarterly index files for the history you need. The EDGAR full-text search API (
efts.sec.gov) also supports real-time queries by form type and date range, which is preferable for production pipelines that need same-day ingestion. - Parse the Item codes from the filing body. The Items disclosed in each 8-K appear in the filing's HTML or text body in a standardized section header format —
Item 1.01,Item 5.02, and so on. A single 8-K can report multiple Items; parse all of them. Do not rely solely on the cover page checkboxes on the EDGAR index page, which are sometimes incomplete or malformatted on older filings. - Timestamp by acceptance datetime, not filing date. EDGAR records both the date the filer intended and the actual acceptance datetime — when the filing cleared EDGAR's validation queue. Use acceptance datetime for a point-in-time correct record. The date-filed field in the index can differ from acceptance by a day at month or quarter boundaries.
- Aggregate per issuer per window. Roll the stamped, Item-tagged records into whatever window suits your research horizon: trailing 30 days, trailing 90 days, or fixed calendar quarters. For each issuer-window, compute total filing count and a breakdown by Item code or Item category group.
The result is a panel of issuer-date-Item counts that can be joined to any other fundamental or market data by CIK and date. This is the foundation for filing velocity as an early-warning signal.
Normalizing for baseline and sector
Raw 8-K counts are not cross-sectionally comparable. A large-cap financial holding company with dozens of subsidiaries will file 8-Ks at a structurally higher rate than a single-product biotech, regardless of any event-driven spike. Three normalization steps make the time series usable:
Issuer baseline adjustment. For each issuer, compute the trailing mean and standard deviation of 8-K filings per quarter over the prior two to three years. Express the current-period count as a z-score relative to that baseline. A z-score above two standard deviations flags an anomalous acceleration at that specific issuer — regardless of whether the absolute count looks high or low.
Peer and sector adjustment. Some Item categories are structurally more common in certain sectors. Energy companies file more 1.01 material agreements around commodity offtake contracts; biotech companies file more 8.01 items around clinical trial results. Compute sector medians for each Item category and normalize the issuer's Item-level counts against the sector peer group. This isolates idiosyncratic spikes from industry-level patterns.
Seasonality. Calendar quarter-end and fiscal year-end periods see elevated 8-K activity across issuers, driven by board meetings, compensation decisions, and auditor engagement letters. A simple month-of-year adjustment — subtracting the cross-sectional average filing rate for that calendar month from each issuer's count — removes the seasonal baseline without requiring a complex decomposition.
After these three adjustments, what remains is the issuer-specific, sector-adjusted, seasonality-stripped filing intensity: the signal component.
Reading the Item mix to classify the event cluster
The normalized count tells you how much activity is occurring; the Item mix tells you what kind. A useful classification framework groups Items into four event-cluster types:
- Deal cluster. Concentration in 1.01 and 2.01 Items, possibly accompanied by 5.02 leadership additions as deal teams integrate. This pattern is consistent with an active M&A or business-development period.
- Stress cluster. Any 2.04 filing is an automatic escalation. Accompanied by 3.02 dilutive equity raises, 1.01 amendments to credit agreements, and 5.02 departures, this cluster is the distress profile. Understanding what a spike in filings means in this context can be the difference between early positioning and reaction.
- Governance cluster. Concentration in 5.02 (leadership changes), possibly 8.01 (other disclosures related to board matters). A governance cluster without accompanying deal or stress items may signal a strategic pivot, activist pressure, or succession planning.
- Disclosure cluster. Dominance of 7.01 and 8.01 Items — voluntary disclosures — with low or zero presence of substantive Items. A company in this pattern is communicating, but not necessarily doing. Treat disclosure clusters as attention-management signals rather than event signals.
The classification does not need to be mutually exclusive. Score each issuer-period on each cluster dimension and allow overlap. A deal cluster with emerging stress indicators is a more nuanced and often more actionable signal than either profile in isolation.
Engineering gotchas
Three recurring problems corrupt 8-K frequency time series when not handled explicitly:
8-K/A amendments. An 8-K/A is an amendment to an earlier 8-K — most commonly filed to add exhibits (financial statements, agreements) that were omitted from the initial filing within the permitted window. Including 8-K/As in a raw count double-counts the event. The correct treatment is to link each 8-K/A back to its parent 8-K by matching on CIK and the referenced filing date, then deduplicate. If you cannot link reliably, exclude 8-K/A forms from the event count and treat them as exhibits only. Keeping them in the count will inflate frequency readings for the issuers most diligent about timely exhibit filing, which is the opposite of what you want.
Exhibits-only filings. Some 8-K filings contain no substantive Item disclosures — they exist solely to attach an exhibit to the EDGAR record. These filings check no Item boxes and contribute no event information. They should be identified by the absence of any Item-code content in the filing body and dropped from the event count. Failing to do so adds noise that is especially prevalent for large issuers with active exhibit-filing programs.
The event-date versus filing-date gap. Companies have four business days from the triggering event to file the 8-K. This means the EDGAR acceptance datetime can lag the actual event by up to a week, and in practice some companies are late filers. For research questions about corporate events, the filing date is what you actually know in real time — it is the point-in-time correct observation date. The event date disclosed within the 8-K body (the date the agreement was signed, the date the officer departed) reflects when the event occurred but would introduce look-ahead bias if used as the observation timestamp in any backtest. Always anchor to acceptance datetime for signal construction; reserve the internal event date for cross-referencing or measuring filing lag as a separate variable.
Folding 8-K frequency into a screen
A frequency time series is most useful when combined with item-level classification and at least one confirming signal from another data source. A practical screen architecture:
- Filter on normalized intensity. Identify issuers in the top decile of z-scored 8-K frequency for the trailing 30 or 90 days. This is your attention universe — companies generating elevated disclosure activity relative to their own history and their sector peers.
- Classify the event cluster. Within the attention universe, separate stress clusters (any 2.04, elevated 3.02) from deal clusters (1.01 and 2.01 dominant) and governance clusters (5.02 dominant). Each cluster warrants a different analytical follow-on.
- Cross-reference with market and fundamental signals. Stress-cluster names should be cross-referenced against bond pricing, equity drawdown, and short interest. Deal-cluster names should be screened against valuation and integration-risk metrics. Governance-cluster names warrant a review of the proxy record and any 13D/13G filings around the same period.
- Apply a recency decay. Weight the most recent filings more heavily than older ones within the trailing window. An acceleration in the most recent two weeks that was absent two months ago is a more actionable signal than an elevated rate sustained at a constant level throughout the window.
The screen does not produce trade recommendations — it produces a ranked list of issuers where elevated 8-K filing frequency indicates that material events are occurring at an above-baseline rate. What those events mean depends on the Item mix, the cluster classification, and whatever additional context the analyst or model brings to bear. The frequency time series is the first filter, not the last.
At scale, this methodology transforms a passive compliance archive into a systematic event-detection layer. The EDGAR record is already there, already dated, already tagged with Item codes. Building the time series is an engineering problem; knowing how to read it is an analytical one. The two together produce a signal that raw filing counts, or most conventional event databases, cannot replicate.