Topics

Updated implementation languages data

David A. Wheeler
 

I have added implementation language information to 895 projects that didn't have it before. As discussed before, they're estimates from GitHub, which isn't *always* correct but is decent.

That means that now 1582 / 1857 projects have information in the implementation_languages field (per "select count(*) from projects where implementation_languages LIKE '_%';"). That will make it much easier to do per-language analysis (you could already download the database, but now we have more information in it). For example, I used our sample analysis program "doc/best-practices.py" to show that there are now 66/1862 projects with R listed as an implementation language (adding the implementation language information identified 31 more projects that used R). In the long term we could provide per-language hints to help people get something done. I'm sure there are other uses too; a number of people, for a variety of reasons, want to know the implementation languages used in projects.

We don't currently have a way to search for languages from inside the application search system, though we have issues related to that:
https://github.com/coreinfrastructure/best-practices-badge/issues/1176
https://github.com/coreinfrastructure/best-practices-badge/issues/1209
If someone wants to implement this and make a pull request, I'm all ears...!

--- David A. Wheeler

Georg Link
 

David,

Thanks for the updated. This is great!

Best,
Georg


On Thu, Sep 20, 2018 at 3:13 PM David A. Wheeler <dwheeler@...> wrote:
I have added implementation language information to 895 projects that didn't have it before. As discussed before, they're estimates from GitHub, which isn't *always* correct but is decent.

That means that now 1582 / 1857 projects have information in the implementation_languages field (per "select count(*) from projects where implementation_languages LIKE '_%';").  That will make it much easier to do per-language analysis (you could already download the database, but now we have more information in it). For example, I used our sample analysis program "doc/best-practices.py" to show that there are now 66/1862 projects with R listed as an implementation language (adding the implementation language information identified 31 more projects that used R).  In the long term we could provide per-language hints to help people get something done.  I'm sure there are other uses too; a number of people, for a variety of reasons, want to know the implementation languages used in projects.

We don't currently have a way to search for languages from inside the application search system, though we have issues related to that:
 https://github.com/coreinfrastructure/best-practices-badge/issues/1176
 https://github.com/coreinfrastructure/best-practices-badge/issues/1209
If someone wants to implement this and make a pull request, I'm all ears...!

--- David A. Wheeler