Data Cleanup
Fund Data Foundation
Clean, structured fund data your AI workflows can actually use.
The Fund Data Foundation is a fixed-price engagement that extracts, cleans, and structures your fund data into the FundCore Data Model. Whether your data lives inside an incumbent admin (SS&C, Citco, Apex, Gen II, BNY Mellon, State Street, Northern Trust, Alter Domus), in a tangle of spreadsheets, or in PDF capital accounts, we map it into a queryable foundation you own and AI agents can actually reason across.
Book Discovery Call to Scope YoursWhere AI for private funds keeps failing
PCAPs that don't reconcile. Capital accounts in conflict across systems. Partnership terms buried in unindexed PDFs. The data underneath every "AI project for funds" is what makes those projects fail. Fix that, and the rest becomes possible.
What's included in a Foundation engagement
- →Data extraction — from your current systems (incumbent admin via API or scheduled flat-file exports, plus spreadsheets, PDFs, partnership docs)
- →Data profiling — we identify what's clean, what's dirty, what's missing, and what's contradictory
- →Cleansing & normalization — to the data model's standards
- →Mapping to data model entities — funds, investors, commitments, capital events, GL accounts, portfolio companies, valuations, partnership terms, side letters
- →Validation — every transformation traceable back to source
- →Delivery — your data, in your control, in a queryable structure you own
- →Documentation — of every cleanup decision (auditable)
Engagement scope
Foundation engagements come in two scopes. Both are fixed-price, scoped on a 30-minute discovery call. The discovery call is free; we follow up with a fixed-price proposal within 48 hours.
Foundation Core
Starting around $10K
- Single fund or simple multi-fund
- Standard PCAP, GL, capital account, PPM ingest
- Smaller LP bases
- Straightforward data sets with clean source systems
- 1–2 week turnaround
Foundation Enterprise
Typically $15–30K
- Complex historical cleanups
- Multi-fund / multi-vintage families
- Larger LP bases
- Bespoke integration mapping with incumbent admins
- 2–3 week turnaround
- Includes data extraction layer for ongoing Subscription
Keep it clean: Data Layer Subscription
A one-time cleanup is great. Ongoing freshness is what makes AI workflows actually work. The Data Layer Subscription keeps your fund's data current with daily or weekly syncs from your incumbent admin's data, handling format changes, new transactions, new investors, and ongoing edge cases without you needing to think about it.
Pricing: Monthly recurring, scoped per fund based on data volume and sync frequency. Quoted alongside the Foundation proposal.
- Scheduled syncs from your incumbent admin (daily or weekly)
- Ongoing cleanup as new data lands
- Schema evolution as your fund's needs change
- Monthly data quality report
- Direct line to us for issues
Brand-new funds skip this
If you're standing up a brand-new fund, there's no legacy data to clean. Skip the Foundation and go straight to new fund onboarding. Different work, different price, same data model underneath. Talk to us about which path makes sense.
Locked into a legacy admin? That's exactly what we solve.
Most medium and large funds are locked into incumbent admins (SS&C, Citco, Apex, Gen II, Alter Domus, BNY Mellon, State Street, Northern Trust). Switching admins is painful. The good news: you don't have to. We map their tables into the FundCore Data Model on top of what they give us (API where possible, scheduled flat-file exports where not). You keep your admin. You unlock your data. Your AI workflows finally have something real to work with.
Ready to scope a Foundation engagement?
Book Discovery Call30 minutes. Fixed-price proposal within 48 hours.