Top Domains by Extracted Triples for Extractor Microformats species
Back to Statistics
This page contains the list of top domains using mf-species of the extraction of October 2023 of the Web Data Commons project.The page shows the top domains employing Microformats species within their websites, ordered by the number of triples found in the crawl corpus.
- wikipedia.org (321,284 triples)
- wikibooks.org (34,902 triples)
- antwiki.org (24,575 triples)
- coastalplainplants.org (13,691 triples)
- blogspot.com (8,872 triples)
- kiddle.co (8,109 triples)
- wikidoc.org (6,346 triples)
- wmflabs.org (3,386 triples)
- wiktionary.org (3,224 triples)
- altwiki.org (2,914 triples)
- wikimedia.org (2,893 triples)
- en-academic.com (2,853 triples)
- handwiki.org (2,630 triples)
- plantdollar.com (2,426 triples)
- amanitaresearch.com (1,499 triples)
- abcdef.wiki (1,192 triples)
- weblio.jp (1,044 triples)
- db0nus869y26v.cloudfront.net (967 triples)
- mdwiki.org (741 triples)
- atptree.com (614 triples)
- findatwiki.com (541 triples)
- wikichiro.org (451 triples)
- umpan.com.my (431 triples)
- sinoxnursery.com (429 triples)
- wikiwand.com (374 triples)
- privacytools.io (340 triples)
- marinemammalscience.org (334 triples)
- cloudflare-ipfs.com (306 triples)
- detailedpedia.com (279 triples)
- marefa.org (248 triples)
- tanijaya.com (224 triples)
- partcommunity.com (217 triples)
- bharatpedia.org (203 triples)
- 654.pl (184 triples)
- wikipredia.net (167 triples)
- iiab.me (135 triples)
- herodictionary.com (131 triples)
- evergreen.edu (131 triples)
- alquds.edu (127 triples)
- besserwiki.de (118 triples)
- kehati.or.id (117 triples)
- es-academic.com (101 triples)
- snaturou2000.sk (100 triples)
- ipfs.io (95 triples)
- treviambiente.it (90 triples)
- planetfish.org (84 triples)
- eagle-rock.org (81 triples)
- definify.com (76 triples)
- idwikipedia.org (75 triples)
- dempseynurseries.com (74 triples)
- enwikipedia.net (73 triples)
- jfoakes.com (72 triples)
- conworkshop.com (72 triples)
- bourndaeec.nsw.edu.au (70 triples)
- ikekoi.fr (66 triples)
- nv-os.org (56 triples)
- kfd.me (56 triples)
- wikizero.com (53 triples)
- chalochatu.org (51 triples)
- possumliving.com (45 triples)
- bigfootcasts.com (44 triples)
- casplantje.nl (38 triples)
- mindquota.com (38 triples)
- msu.edu (34 triples)
- copro.com.ar (34 triples)
- techsciencenews.com (34 triples)
- majutani.com (34 triples)
- meddic.jp (32 triples)
- cavallidacorsa.org (32 triples)
- karaitivu.org (32 triples)
- thewinehacker.com (30 triples)
- wikizero.org (30 triples)
- ant-photo.eu (28 triples)
- benakhati.com (28 triples)
- iwherbalzworld.com (27 triples)
- universityhealthcenter.in (26 triples)
- knowpia.com (26 triples)
- indahcraft.net (26 triples)
- farwellfruitfarm.com (25 triples)
- wikipediaforschools.org (25 triples)
- infogalactic.com (25 triples)
- seed.ir (24 triples)
- teamofmonkeys.com (23 triples)
- worldchampionshipcoyotecallingcontest.com (23 triples)
- hailcincinnati.com (23 triples)
- neuroinf.jp (23 triples)
- conworlds.org (23 triples)
- classicistranieri.com (23 triples)
- dailygaggle.com (23 triples)
- catfishconference.com (23 triples)
- squper.com (23 triples)
- fishkeepingforever.com (22 triples)
- dmfarm.it (22 triples)
- weblaboratorium.hu (22 triples)
- mrowl.com (20 triples)
- encyclopediaofastrobiology.org (20 triples)
- urbanpestis.com (19 triples)
- hyperlinked.wiki (19 triples)
- thebiofiles.com (18 triples)
- webot.org (17 triples)
- pihattcoffee.com (17 triples)
- greengoldghana.com (17 triples)
- wazji.pl (17 triples)
- naturescapesofbeaufort.com (17 triples)
- kachaf.com (17 triples)
- arabsciencepedia.org (17 triples)
- sw-em.com (17 triples)
- thcscience.wiki (17 triples)
- mate-tea.net (17 triples)
- thekitchenplayground.com (17 triples)
- wikinfo.org (17 triples)
- wiki2.org (16 triples)
- theplantlady.com (16 triples)
- jamesdickfineart.com (16 triples)
- usmantis.com (16 triples)
- plantscapedubai.com (16 triples)
- aaichisavali.com (16 triples)
- animalstime.com (16 triples)
- wikizero.net (15 triples)
- wanweibaike.net (15 triples)
- niaoleiba.com (15 triples)
- birdmanspetsource.com (15 triples)
- stekom.ac.id (14 triples)
- seashellshop.com (14 triples)
- infoanew.com (13 triples)
- faunadanflora.com (13 triples)
- fijibutterflyfishcount.com (13 triples)
- sbiras.cz (13 triples)
- omniversalis.org (13 triples)
- vollmedica.eu (13 triples)
- digiprotein.ir (12 triples)
- koumtsidis.gr (12 triples)
- slipfox.xyz (11 triples)
- qiuwenbaike.cn (11 triples)
- ecofood.hk (10 triples)
- profilbaru.com (10 triples)
- originalpeople.org (10 triples)
- fotoartbook.com (10 triples)
- wildroots.in (10 triples)
- silichip.org (10 triples)
- biota.pt (10 triples)
- uncyclopedia.co (9 triples)
- atozwiki.com (9 triples)
- cannaqa.wiki (8 triples)
- profillengkap.com (8 triples)
- wikizand.com (8 triples)
- gabitos.com (8 triples)
- satriahewan.com (8 triples)
- wiki.edu.vn (7 triples)
- profilpelajar.com (7 triples)
- wikipedia-on-ipfs.org (6 triples)
- eymaps.com (6 triples)
- bildiris.com (6 triples)
- histo.cat (6 triples)
- yoda.wiki (6 triples)
- westernrivers.org (6 triples)
- wikigerman.edu.vn (5 triples)
- tazintosh.com (5 triples)
- revivalmushroom.com (5 triples)
- themagictruffleshop.com (5 triples)
- happyrockpets.com (5 triples)
- drtharangawickramasooriya.com (5 triples)
- hiagro.com (5 triples)
- prfrp.org (5 triples)
- galaxy-vn.com (5 triples)
- wikipedia.su (5 triples)
- tldrify.com (5 triples)
- mafiacorruption.pl (5 triples)
- tojsiab.com (5 triples)
- ralfschepp.de (5 triples)
- andishehstars.com (5 triples)
- uncyclopedia.com (5 triples)
- montagneaperte.it (5 triples)
- wikirank.net (5 triples)
- laketoba.net (5 triples)
- rc-org.com (4 triples)
- organicfooda.com (4 triples)
- sunitjotravel.com (4 triples)
- hanauma.org (4 triples)
- selfstudyanthro.com (4 triples)
- brihaat.com (4 triples)
- bingj.com (4 triples)
- aimasworld.in (4 triples)
- pooyesh-dar-kardarmani-karaj.ir (4 triples)
- qudswiki.org (4 triples)
- whyevolutionistrue.com (4 triples)
- educationgo.co.in (4 triples)
- limswiki.org (4 triples)
- moidart.com (4 triples)
- texasbestshop.com (3 triples)
- bradsgreenhouse.com (3 triples)
- factanimal.com (3 triples)
- ilperfettocane.com (3 triples)
- e-imoti.com (3 triples)
- kidzfeed.com (3 triples)
- signmaker.gr (3 triples)
- kexuedabaike.com (3 triples)
- podpedia.org (3 triples)
- chew.wiki (3 triples)
- superyachtcuisine.com (2 triples)
- micronations.wiki (2 triples)
- jangala-magazine.com (2 triples)
- biotagroup.org (2 triples)
- avalonplants.com (2 triples)
- thelazypot.com (2 triples)
- siavash-ataee.ir (2 triples)
- amadertangail24.com (2 triples)
- meowsjr.com (2 triples)
- mediafreedom.us (2 triples)
- allglobal.net (2 triples)
- keocopa1.com (2 triples)
- devazen.com (2 triples)
- ivyparadiseplant.com (2 triples)
- westwoodpavillion.com (2 triples)
- pictures-of-cats.org (2 triples)
- veryfood69.com (2 triples)
- thestories.kr (2 triples)
- explore-science-beyond-the-classroom.com (2 triples)
- thucanh.vn (2 triples)
- fertitienda.com (2 triples)
- eczhanem.com (2 triples)
- businesscrystal.com (1 triples)
- audreypuiyan.com (1 triples)
- kekal.id (1 triples)
- namazimedplant.ir (1 triples)
- mariyetanne.com (1 triples)
- horseandman.com (1 triples)
- sogdatacentre.ca (1 triples)
- aikdesigns.com (1 triples)
- bobscentral.com (1 triples)
- officialbangla.com (1 triples)
- dth-offer.com (1 triples)
- contextbusiness.com (1 triples)
- littletreedesignbiotopes.es (1 triples)
- foodsweeteners.com (1 triples)
- ninrio.com (1 triples)
- alanskeoch.ca (1 triples)
- khuzestankhabar.ir (1 triples)
- oracleblog.org (1 triples)
- hepsiburada.com (1 triples)
- breakingnewshubss.com (1 triples)
- learn-barmaga.com (1 triples)
- ardiyansyah.com (1 triples)
- amliebstenreisen.at (1 triples)
- moaragh-simorgh.com (1 triples)
- acamedia.info (1 triples)
- bestinfoz.net (1 triples)
- rashal.com (1 triples)
- scientiaen.com (1 triples)
- tapantareinews.gr (1 triples)
- aimasworld.com (1 triples)
- foroactivo.com (1 triples)
- minyakpelet.web.id (1 triples)
- cnrs.fr (1 triples)
- biyologlar.com (1 triples)
- tjitra.nl (1 triples)
- qds.pt (1 triples)
- myteachinglibrary.com (1 triples)
- fishingproexclusive.com (1 triples)
- uau.ro (1 triples)
- healthguidenet.com (1 triples)
- mybusinessguide.us (1 triples)
- ec2-18-140-62-199.ap-southeast-1.compute.amazonaws.com (1 triples)