Top Domains by Extracted Triples for Extractor Microformats species


Back to Statistics

This page contains the list of top domains using mf-species of the extraction of October 2023 of the Web Data Commons project.The page shows the top domains employing Microformats species within their websites, ordered by the number of triples found in the crawl corpus.


  1. wikipedia.org (321,284 triples)
  2. wikibooks.org (34,902 triples)
  3. antwiki.org (24,575 triples)
  4. coastalplainplants.org (13,691 triples)
  5. blogspot.com (8,872 triples)
  6. kiddle.co (8,109 triples)
  7. wikidoc.org (6,346 triples)
  8. wmflabs.org (3,386 triples)
  9. wiktionary.org (3,224 triples)
  10. altwiki.org (2,914 triples)
  11. wikimedia.org (2,893 triples)
  12. en-academic.com (2,853 triples)
  13. handwiki.org (2,630 triples)
  14. plantdollar.com (2,426 triples)
  15. amanitaresearch.com (1,499 triples)
  16. abcdef.wiki (1,192 triples)
  17. weblio.jp (1,044 triples)
  18. db0nus869y26v.cloudfront.net (967 triples)
  19. mdwiki.org (741 triples)
  20. atptree.com (614 triples)
  21. findatwiki.com (541 triples)
  22. wikichiro.org (451 triples)
  23. umpan.com.my (431 triples)
  24. sinoxnursery.com (429 triples)
  25. wikiwand.com (374 triples)
  26. privacytools.io (340 triples)
  27. marinemammalscience.org (334 triples)
  28. cloudflare-ipfs.com (306 triples)
  29. detailedpedia.com (279 triples)
  30. marefa.org (248 triples)
  31. tanijaya.com (224 triples)
  32. partcommunity.com (217 triples)
  33. bharatpedia.org (203 triples)
  34. 654.pl (184 triples)
  35. wikipredia.net (167 triples)
  36. iiab.me (135 triples)
  37. herodictionary.com (131 triples)
  38. evergreen.edu (131 triples)
  39. alquds.edu (127 triples)
  40. besserwiki.de (118 triples)
  41. kehati.or.id (117 triples)
  42. es-academic.com (101 triples)
  43. snaturou2000.sk (100 triples)
  44. ipfs.io (95 triples)
  45. treviambiente.it (90 triples)
  46. planetfish.org (84 triples)
  47. eagle-rock.org (81 triples)
  48. definify.com (76 triples)
  49. idwikipedia.org (75 triples)
  50. dempseynurseries.com (74 triples)
  51. enwikipedia.net (73 triples)
  52. jfoakes.com (72 triples)
  53. conworkshop.com (72 triples)
  54. bourndaeec.nsw.edu.au (70 triples)
  55. ikekoi.fr (66 triples)
  56. nv-os.org (56 triples)
  57. kfd.me (56 triples)
  58. wikizero.com (53 triples)
  59. chalochatu.org (51 triples)
  60. possumliving.com (45 triples)
  61. bigfootcasts.com (44 triples)
  62. casplantje.nl (38 triples)
  63. mindquota.com (38 triples)
  64. msu.edu (34 triples)
  65. copro.com.ar (34 triples)
  66. techsciencenews.com (34 triples)
  67. majutani.com (34 triples)
  68. meddic.jp (32 triples)
  69. cavallidacorsa.org (32 triples)
  70. karaitivu.org (32 triples)
  71. thewinehacker.com (30 triples)
  72. wikizero.org (30 triples)
  73. ant-photo.eu (28 triples)
  74. benakhati.com (28 triples)
  75. iwherbalzworld.com (27 triples)
  76. universityhealthcenter.in (26 triples)
  77. knowpia.com (26 triples)
  78. indahcraft.net (26 triples)
  79. farwellfruitfarm.com (25 triples)
  80. wikipediaforschools.org (25 triples)
  81. infogalactic.com (25 triples)
  82. seed.ir (24 triples)
  83. teamofmonkeys.com (23 triples)
  84. worldchampionshipcoyotecallingcontest.com (23 triples)
  85. hailcincinnati.com (23 triples)
  86. neuroinf.jp (23 triples)
  87. conworlds.org (23 triples)
  88. classicistranieri.com (23 triples)
  89. dailygaggle.com (23 triples)
  90. catfishconference.com (23 triples)
  91. squper.com (23 triples)
  92. fishkeepingforever.com (22 triples)
  93. dmfarm.it (22 triples)
  94. weblaboratorium.hu (22 triples)
  95. mrowl.com (20 triples)
  96. encyclopediaofastrobiology.org (20 triples)
  97. urbanpestis.com (19 triples)
  98. hyperlinked.wiki (19 triples)
  99. thebiofiles.com (18 triples)
  100. webot.org (17 triples)
  101. pihattcoffee.com (17 triples)
  102. greengoldghana.com (17 triples)
  103. wazji.pl (17 triples)
  104. naturescapesofbeaufort.com (17 triples)
  105. kachaf.com (17 triples)
  106. arabsciencepedia.org (17 triples)
  107. sw-em.com (17 triples)
  108. thcscience.wiki (17 triples)
  109. mate-tea.net (17 triples)
  110. thekitchenplayground.com (17 triples)
  111. wikinfo.org (17 triples)
  112. wiki2.org (16 triples)
  113. theplantlady.com (16 triples)
  114. jamesdickfineart.com (16 triples)
  115. usmantis.com (16 triples)
  116. plantscapedubai.com (16 triples)
  117. aaichisavali.com (16 triples)
  118. animalstime.com (16 triples)
  119. wikizero.net (15 triples)
  120. wanweibaike.net (15 triples)
  121. niaoleiba.com (15 triples)
  122. birdmanspetsource.com (15 triples)
  123. stekom.ac.id (14 triples)
  124. seashellshop.com (14 triples)
  125. infoanew.com (13 triples)
  126. faunadanflora.com (13 triples)
  127. fijibutterflyfishcount.com (13 triples)
  128. sbiras.cz (13 triples)
  129. omniversalis.org (13 triples)
  130. vollmedica.eu (13 triples)
  131. digiprotein.ir (12 triples)
  132. koumtsidis.gr (12 triples)
  133. slipfox.xyz (11 triples)
  134. qiuwenbaike.cn (11 triples)
  135. ecofood.hk (10 triples)
  136. profilbaru.com (10 triples)
  137. originalpeople.org (10 triples)
  138. fotoartbook.com (10 triples)
  139. wildroots.in (10 triples)
  140. silichip.org (10 triples)
  141. biota.pt (10 triples)
  142. uncyclopedia.co (9 triples)
  143. atozwiki.com (9 triples)
  144. cannaqa.wiki (8 triples)
  145. profillengkap.com (8 triples)
  146. wikizand.com (8 triples)
  147. gabitos.com (8 triples)
  148. satriahewan.com (8 triples)
  149. wiki.edu.vn (7 triples)
  150. profilpelajar.com (7 triples)
  151. wikipedia-on-ipfs.org (6 triples)
  152. eymaps.com (6 triples)
  153. bildiris.com (6 triples)
  154. histo.cat (6 triples)
  155. yoda.wiki (6 triples)
  156. westernrivers.org (6 triples)
  157. wikigerman.edu.vn (5 triples)
  158. tazintosh.com (5 triples)
  159. revivalmushroom.com (5 triples)
  160. themagictruffleshop.com (5 triples)
  161. happyrockpets.com (5 triples)
  162. drtharangawickramasooriya.com (5 triples)
  163. hiagro.com (5 triples)
  164. prfrp.org (5 triples)
  165. galaxy-vn.com (5 triples)
  166. wikipedia.su (5 triples)
  167. tldrify.com (5 triples)
  168. mafiacorruption.pl (5 triples)
  169. tojsiab.com (5 triples)
  170. ralfschepp.de (5 triples)
  171. andishehstars.com (5 triples)
  172. uncyclopedia.com (5 triples)
  173. montagneaperte.it (5 triples)
  174. wikirank.net (5 triples)
  175. laketoba.net (5 triples)
  176. rc-org.com (4 triples)
  177. organicfooda.com (4 triples)
  178. sunitjotravel.com (4 triples)
  179. hanauma.org (4 triples)
  180. selfstudyanthro.com (4 triples)
  181. brihaat.com (4 triples)
  182. bingj.com (4 triples)
  183. aimasworld.in (4 triples)
  184. pooyesh-dar-kardarmani-karaj.ir (4 triples)
  185. qudswiki.org (4 triples)
  186. whyevolutionistrue.com (4 triples)
  187. educationgo.co.in (4 triples)
  188. limswiki.org (4 triples)
  189. moidart.com (4 triples)
  190. texasbestshop.com (3 triples)
  191. bradsgreenhouse.com (3 triples)
  192. factanimal.com (3 triples)
  193. ilperfettocane.com (3 triples)
  194. e-imoti.com (3 triples)
  195. kidzfeed.com (3 triples)
  196. signmaker.gr (3 triples)
  197. kexuedabaike.com (3 triples)
  198. podpedia.org (3 triples)
  199. chew.wiki (3 triples)
  200. superyachtcuisine.com (2 triples)
  201. micronations.wiki (2 triples)
  202. jangala-magazine.com (2 triples)
  203. biotagroup.org (2 triples)
  204. avalonplants.com (2 triples)
  205. thelazypot.com (2 triples)
  206. siavash-ataee.ir (2 triples)
  207. amadertangail24.com (2 triples)
  208. meowsjr.com (2 triples)
  209. mediafreedom.us (2 triples)
  210. allglobal.net (2 triples)
  211. keocopa1.com (2 triples)
  212. devazen.com (2 triples)
  213. ivyparadiseplant.com (2 triples)
  214. westwoodpavillion.com (2 triples)
  215. pictures-of-cats.org (2 triples)
  216. veryfood69.com (2 triples)
  217. thestories.kr (2 triples)
  218. explore-science-beyond-the-classroom.com (2 triples)
  219. thucanh.vn (2 triples)
  220. fertitienda.com (2 triples)
  221. eczhanem.com (2 triples)
  222. businesscrystal.com (1 triples)
  223. audreypuiyan.com (1 triples)
  224. kekal.id (1 triples)
  225. namazimedplant.ir (1 triples)
  226. mariyetanne.com (1 triples)
  227. horseandman.com (1 triples)
  228. sogdatacentre.ca (1 triples)
  229. aikdesigns.com (1 triples)
  230. bobscentral.com (1 triples)
  231. officialbangla.com (1 triples)
  232. dth-offer.com (1 triples)
  233. contextbusiness.com (1 triples)
  234. littletreedesignbiotopes.es (1 triples)
  235. foodsweeteners.com (1 triples)
  236. ninrio.com (1 triples)
  237. alanskeoch.ca (1 triples)
  238. khuzestankhabar.ir (1 triples)
  239. oracleblog.org (1 triples)
  240. hepsiburada.com (1 triples)
  241. breakingnewshubss.com (1 triples)
  242. learn-barmaga.com (1 triples)
  243. ardiyansyah.com (1 triples)
  244. amliebstenreisen.at (1 triples)
  245. moaragh-simorgh.com (1 triples)
  246. acamedia.info (1 triples)
  247. bestinfoz.net (1 triples)
  248. rashal.com (1 triples)
  249. scientiaen.com (1 triples)
  250. tapantareinews.gr (1 triples)
  251. aimasworld.com (1 triples)
  252. foroactivo.com (1 triples)
  253. minyakpelet.web.id (1 triples)
  254. cnrs.fr (1 triples)
  255. biyologlar.com (1 triples)
  256. tjitra.nl (1 triples)
  257. qds.pt (1 triples)
  258. myteachinglibrary.com (1 triples)
  259. fishingproexclusive.com (1 triples)
  260. uau.ro (1 triples)
  261. healthguidenet.com (1 triples)
  262. mybusinessguide.us (1 triples)
  263. ec2-18-140-62-199.ap-southeast-1.compute.amazonaws.com (1 triples)