I’m looking for a data archive of corporation ownership networks. For example, Alphabet owns Google, … and some metadata like when they are created/owned by Alphabet if possible. I was made aware of OpenCorporates but it doesn’t seem to have such data as far as I tried.
Apologies in advance if this is not an appropriate content for the community. I figured digital archivists may be aware of the existence of such archive. I couldn’t find a specific lemmy community solely for asking about data suggestions. If there’s a community better suited for this post, please let me know.
I would also be really interested in a dataset like this. Institutional investors for public companies are sometimes listed in the SEC 10k form, but I have yet to fully learn the structure of these documents. Someone intelligent could probably make a web crawler to scrape institutional and single person ownership of companies from SEC filings. Ideally with a proper dataset like this you could map out who owns each company, and who owns each of the owning companies etc. which would be really interesting information.
I don’t currently have adequate understanding of different SEC FORMS, and although I can use python I have no experience writing web crawlers at the moment.
I dont have any experience with understanding SEC forms either. Is there a repository for SEC forms? Or do you imagine looking at all companies website to mine for those forms?
SEC has the Edgar database where you can lookup any company and access there different SEC forms, but you still need to know which forms to look for the information in. For example, the 10k of one company had the ownership informing of top shareholders, but I wasn’t able to find that info in the 10k of another company (possibly because I didn’t know where to look). I know you can use EDGAR database to at least lookup these forms, but I do not know the full capabilities of the database (such as if you can query for ownership directly) because I just discovered it the other day.