

For analysis I grab the logic from the Wappalizer project and programmatically turn that into PHP logic I then grab each top domain's and versions and analyse the first that presents a 200 HTTP code. The previous statistics gathering scripts ran for several days per chart, the new scripts finished the job for all the countries and CMSes in just a couple of hours.Įssentially, I obtain the list of top websites per country from Amazon through their web services, I then strip out all domains not in the ccTLD of the country and proceed with analysis. I'm also storing data YAML format which is the new configuration file storage format for Drupal 8. I'm now running the statistics gathering scripts as PHP CLI scripts that are much faster that the previous Drupal-based solution. It still features the top Drupal websites in South Africa ( ), but now also lists several other countries and top websites charts for Joomla! and WordPress (so we have a list of websites that need to be turned into Drupal websites -).

Some initial countries: UAE, South Africa, International, Japan and the Czech Republic (capital is Prague, where the next DrupalCon will be held). Our new Top Websites site is live (though still in beta). These items may well wait to serve as a great project to learn D8 with over December. Finally, a new theme and exposure of more data, like a migration list over time to see which sites moved to and from Drupal.
#Really cool websites code
The ripper can do with better code and unifying the sites will be nice too. But it would be great to pick a different country every month to generate statistics for in the near future. Other Drupal labors of love in "free" time like this podcast aggregator for Drupal with a unified feed and "real work" will likely hold the ideas up a bit. It ran a few times already to generate the top South African websites for us, the history of which is available here: The site was first put together to get a better idea of the top South African websites that run Drupal as we're in the beginnings of putting together Drupal Association South Africa. One large part of the detection logic of Wappalizer needs a JS run-time environment which is obviously not available to the PHP parser. Many top-end sites prefer to keep such things hidden.

I didn't write additional logic to try to determine if a site is using Drupal when the site takes obvious measures to obfuscate the use of Drupal. The same module also obtains the detection JSON from the Wappalizer project and re-implements it as PHP logic on the fly.
#Really cool websites serial
The top sites list comes from Alexa as sold by Amazon (which owns Alexa) through their Web Services.Ĭurrently the site data is then gathered with a custom module that parses them in serial (needs to be rewritten to be faster with a non-blocking rolling cURL ripper some time in the future). This website lists the current top 100 Drupal websites (with a few limitations).Įdit (): lists the top Drupal websites (in the top 1 million websites worldwide) and it updated weekly.
