martes, 31 de diciembre de 2013

Bitergia - Software analytics for your peace of mind

In previous posts, MetricsGrimoire and FLOSSMetrics were presented to show how already existing Free/Open Source Software Projects already exist in order to obtain metrics and try to evaluate the Quality and State of the Art of software for a particular project, whether it is FLOSS or not.

However, these tools, that are focused on retrieval and storage of the projects on a Database in order to created structured data, do not focus on the analysis, presentation or exploitation of the data.
 For this reason, the company Bitergia, has created a set of tools that enable, on the one hand, an analysis framework, written in GNU-R, called VizGrimoireR, and, on the other hand, a Dashboard-like web presentation of the metrics analysed, through the VizGrimoireJS library. A demo can be observed on this link.




These tools, together with MetricsGrimoire, help on making the circle narrower in order to generate complete reports to help on diagnosing, analysing and improving software projects (Free/Open Source preferably, but not necessarily).

Examples of the kind of reports being generated are, for example:

Mediawiki: http://bitergia.com/public/previews/2012_11_mediawiki/
Webkit: http://bitergia.com/public/reports/webkit/2013_01/index.html
FusionForge: http://bitergia.com/public/previews/2012_11_fusionforge/
OpenStack (Havana Release): http://blog.bitergia.com/2013/10/17/the-openstack-havana-release/
LifeRay: http://blog.bitergia.com/2013/10/23/liferay-and-its-growth-during-the-last-years/

This company, which has still a long way to go, above all in the Private Companies environment, rather than Open Source Projects, is also a good example on how to provide services through Free/Open Source Software, justifying that business models around Open Source are possible to implement.


lunes, 30 de diciembre de 2013

FLOSSmetrics

FLOSSmetrics (Free/Libre and Open Source Software Metrics) was a Project, funded by the European Commission, whose main objective was to retrieve and analyse a big database with information and metrics about Free/Open Source Software projects using already existing methodologies, and relying on already developed tools.

The project achieved the retrieval of more than 2800 FLOSS projects in Melquiades (lamentably down nowadays), by using existing Open Source tools such as CVSAnalY.

Among the most important broadcast activities, a talk in FOSSDEM 2009.

The main objectives of the project were:

- Identification and evaluation of sources of data.
- Creation of a comprehensive database from previous sources.
- Integration of already available tools to process data and create a complete platform for its availability.
- Maintenance of the empirical database by automation of extraction of data of thousands of Open Source projects with extraction tools.
- Develop of studies, (benchmarking, simulations and comparisons of FLOSS projects).
- Propagation and exploitation of the results.

To achieve the objectives, delivery of next Work Packages (WP) was performed:
Among the different documents resultant of the project, FLOSS Guide for SMEs (small and medium enterprises) is a very interesting one, advent to present a set of guidelines and suggestions for the adoption of open source software within SMEs

On 2010, elaboration of a Final Report  concluded this very interesting and fruitful collaboration project.

jueves, 26 de diciembre de 2013

Metrics Grimoire - A toolset to generate software metrics

As time goes by, it has been demonstrated that software metrics are important to categorize the status and health of a particular project, independently of the reason that it is or not an Open Source project.

Metrics Grimoire toolset allows to obtain certain amount of parameters from different kind of sources to generate metrics that enable analyzing a software project.

Among the different tools that the toolset provides, next ones are the most remarkable:

CVSAnalY: Do not be confused because of its name. CVSAnalY project analyzes different kind of repositories (CVS, as well as Subversion and GIT) and stores analyzed information on a SQL database. This information can be later analyzed in order to measure, for example:
- Most active committers.
- Number of commits in a period of time, or a a particular time or date.
- Territoriality of the project, by means of calculation of the different people modifying a particular file.
- And, in general, all the information that can be retrieved from that particular Software Configuration Manager.

Bicho: This tool allows to perform analysis on different Bug Trackers (Bicho means "Bug" in Spanish). The tool enables browsing different kind of Issue Trackers, and store some data from then in an SQL Database. Among the supported Issue Trackers, next ones are found:

- Bugzilla
- Launchpad
- Jira (unstable, partially supported)
- Allura (unstable, partially supported)
- Github (unstable, partially supported)
- SourceForge (abandoned)

The tool allows to obtain metrics such as:
- Number of issues of a particular state.
- Number of issues whose state changed on a particular period of time.
- Top Issue Openers/Resolvers/Closers
- People who have been assigned with a higher amount of Bugs
- And, in general, all the information that can be retrieved from that particular Issue Tracker.

MlStats: This tool allows to retrieve information from a mail list. It works basically with mailman. The tool allows to analyze information such as:
- Number of messages in a day for a particular mail list.
- Top askers and responders.
- % of messages responded.
- And, in general, all the information that can be retrieved from mailman mail list system.

To summarize, a very complete set of tools, which are Free Software (as in Freedom), which are costless, and wich enable to provide analysis of Open Source projects, but also Proprietary software on companies using previous described sources.

sábado, 23 de noviembre de 2013

Motivating community for starting Open Source Projects

Open Source Projects are, indeed, not far from other type of associations or companies where motivation is a key.

Community building is important for already existing projects with strong communities behind.

However, starting Open Source projects are also challenged to show what the project offers, the kind of beneficiaries this project should have, technology , members of the project, and so on and so forth.

I would like to share a nice motivating video on how to "sell" to the community a starting project, in this case, a project hosted by the Linux Foundation, the OpenDayLight Open Source Project [1]:


The video, from my perspective, it is interesting as a good example of community attractive video because of next reasons:

1 - It is concise.
2 - It shows the motivation of the project clearly.
3 - It is "smart" and graphically attractive.

You work as computer engineer or as network engineer. You know Big Data, Cloud Computing and SDN will be the future technologies. Don't you feel a little bit motivated to participate in the community after watching the video?

References:
http://www.youtube.com/watch?v=ZWzMT5KLEtY


Developers Motivations - Why start contributing to Open Source Projects?

There is a huge amount of software developers who always have worked on privative software, basically, for companies that produce software .

However, as an additional task, it is interesting for them to understand why they should start contributing to Open Source projects.

On next video, Jessica McKellar, Open Source advocate, core developer of Python and member of the Python Software Foundation, and maintainer for Twisted and OpenHatch [1],  explains, among other stuff, some clear reasons about why someone should consider, at least, start contributing to an Open Source project [2]:




From Jessica's perspective, contribution to Open Source projects should be performed in order to:

1) Give back to the community. Above all, if you like a particular technology or project, you should consider start contributing to it as a way of restitution.

2) People. As far as she is concerned, smartest people she knows comes from Open Source project movement.

3) Education.  With a Master on Computer Engineering, Jessica recognizes not many lessons being taken on Software Engineering. Contributing to an Open Source project is a way of enforcing knowledge on Software Engineering, as well  as obtaining experience.

4) Work. Open Source contribution is a way of transparency and a "do-ocracy" way of showing someone's capabilities to people and companies hiring.

5) Enjoying. In the end, contributing to a community ends being an enjoyable thing.

The video also shows a complete bunch of topics on FLOSS contribution, as different ways to contribute. Apart from that, Jessica also explains on a very simple way the different technical aspects common to many projects, such as the Bug Issue tracker, the ways of performing changes in code and generating a patch, etc.

References:
[1] http://web.mit.edu/jesstess/www/
[2] http://www.youtube.com/watch?v=IXnNgLmd6BM

domingo, 17 de noviembre de 2013

FLOSS Foundations

Linux Foundation, Free Software Foundation, Apache Foundation, Mozilla Foundation and Eclipse Foundation could be categorized as the "Top Foundations" in terms of popularity. Hosting projects as the Linux Kernel, GNU tools, the Apache Web Server, Mozilla Firefox or the Eclipse Editor.

However, there are more Foundations apart from the previous ones. In fact, there is a whole bunch of Foundations, associated to the most successful Open Source projects. A good place to check them is the Foundations directory of http://flossfoundations.org/, where other important foundations can be found. Among Foundations associated to Open Source Projects, it can be found [2]:

- The Blender Foundation
- Django Software Foundation
- FreeBSD Foundation
- Gentoo Foundation
- GNOME Foundation
- KDE e.V.
- NetBSD Foundation
- OpenBSD Foundation
- The Perl Foundation
- PostgreSQL.US
- Python Foundation
- X.org Foundation

Besides this, we find other very important associations, not associated to a particular project, but to accomplish Open Source and Open-knowledge advocating and promotion. Examples of them are [2]:

- Wikimedia Foundation
- OpenSource Initiative
- Free Software Foundation
- Tex Users Group
- Creative Commons
- OpenDoc Society
- Subversion Corporation

Another category of associations are the ones having to do with regional promotion of FLOSS.  Examples of them [2]:

- Associaçao SoftwareLivre.org (Brasil)
- El Centro de Software Libre (Chile)
- Fundación Via Libre (Argentina)
- Free Software and Open Source Foundation for Africa
- Software Libre Argentina
- Software Livre Brasil
- Software Libre Chile

Some other Foundations are dedicated to promotion of open standards across the Web and Internet. The most remarkable examples are  [2]:

- Internet Software Consortium
- XMPP Standards Foundation
- Xiph.org

Last, but not least, other Foundations have to do with heterogeneous activities, such as Open Health promotion, Digital Rights defence,  Public-Domain works promotion, Social Development, etc. Examples of this kind of Foundations are:

- Electronic Frontier Foundation
- Shuttleworth Foundation
- OpenHealth Tools
- OpenHardware Foundation

In the end, a lot of Foundations appearing on the list.
However, which are the most active? Which of them have greater supporters? Who of them have strong companies behind?
A good parameter for evaluation has to do with revenues. No doubt the most successful project, the Linux kernel, has behind the Foundation with the higher revenues. A good article from Brian Proffitt for ITWorld, showed the revenues of the most important Foundations in the USA [3]:

The Linux Foundation: 2010 Exercise:  Revenues: 9,618,043$
Mozilla Foundation: 2010 Exercise: Revenues: 1,934,659$
Free Software Foundation Inc.: 2009/2010 Exercise: Revenues: 889,400$
Python Software Foundation: 2010 Exercise: Revenues: 706,136$
Gnome Foundation Inc.: 2009/2010 Exercise: Revenues: 636,648$
Apache Software Foundation: 2010/2011 Exercise: Revenues: 539,410$

In the end, the Linux Foundation, surely because of the great companies behind it.

To conclude, just highlighting that, although the numbers of Computer, Technology and Internet industry are huge, no significant amount of money is handled by the most important Foundations (Linux Foundation as exception), taking into account the importance of the projects behind them, as well as the important labour they accomplish.

Reference:
[1] http://flossfoundations.org
[2]  http://flossfoundations.org/foundation-directory
[3] http://www.itworld.com/it-managementstrategy/260688/nonprofit-open-source-organizations-booming

viernes, 1 de noviembre de 2013

Forgotten Names in Open Source - 2013 update

It is not the first time I write about forgotten names in Open Source world. However, I pretend to rewrite the list of forgotten names as time goes by, in order to include more names that I could have previously forgotten, or even that appear in the scene due to the increasing popularity in the actual Open Source scene.

For this reason, this post is not about Richard Stallman, Linus Torvalds, Miguel de Icaza, ... These are  recognized names in Open Source Software environment.

In the "Open Source Hall of Fame", the most known names would be, from my perspective:
- Linus Torvalds: Linux/Git
- Richard Stallman: GNU

Period. These two names are the most known names in the scene., and continue to be, up to date. With a little bit less importance, but still very known, there is a bunch of name including:

- Miguel de Icaza: Gnome/Mono
- Mattias Ettrich: Kde
- Bruce Perens: OSI/Debian
- Eric S. Raymond: OSI/fetchmail
- Donald E. Knuth: Latex
- Theo de Raadt: OpenBSD/OpenSSH
- Mark Shuttleworth: Ubuntu

Meanwhile, on the other hand, there is a much bigger bunch of names that are not as recognized as they should. The aim of this entry is to bring to foreground some of these names, in order to recognize their contribution to the Open Source Community, and also allow readers of this blog to make their opinions of other names that should be included.
Below are, IMHO, the usually names that are not recognized as they should in the Open Source scene:

Robert McCool:
Original author of the NCSA HTTP server, which later turned into the Apache Web Server.

Jamie Zawinski:
American Computer programmer responsible for significant contributions to the free software projects Mozilla and XEmacs, and early versions of the Netscape Navigator web browser. He also maintained the XScreenSaver project

Ian Murdock:
Founder of the Debian distribution, wrote the Debian Manifesto in 1993 while a student at Purdue University.

Brian J. Fox:
Active developer of the FSF. Author of GNU Bash, GNU Makeinfo, GNU Info, GNU Finger, and readline libraries.

Alan Cox:
One of the first Linux kernel testers, Cox discovered and fixed many of existing bugs and went on to rewrite much of the networking subsystem.
He then became one of the main developers and maintainers of the whole kernel.
He has also been involved in the GNOME and X.Org projects.
He left Linux kernel development in January 2013, supposedly due to familiar issues.

Greg Kroah-Hartman:
Linux kernel developer, maintainer of the stable branch, as well as various subsystems such as USB, driver core, debugfs, userspace I/O and TTY layer. It is surely the number 2 in Linux kernel project after Alan Cox has quit from the project.

Larry Wall:
Creator of Perl Programming Language, also known for beeing the developer of the "patch" tool or "rn", the Usenet client.

Guido Van Rossum:
Dutch computer programmer, author of the Python programming language.

Yukihiro Matsumoto:
Japanese computer scientist and software programmer, best known as the chief designer of the Ruby programming language.

Brendan Eich:
Computer programmer and creator of the JavaScript scripting language. He is the chief technology officer at the Mozilla Corporation.

Rasmus Lerdorf:
Danish programmer, creator of the PHP language.

John Chambers - Ross Ihakaand/Robert Gentleman:
Creators of S (Chambers) and R (Ihakaand/Gentleman) statistical computing programming languages.

Mart Otto/Jacob Thornton:
Creators of the Twitter Bootsrap framework, one of the most web frameworks used nowadays for creation of Responsive Web pages and Web applications.

James Strachan/Graeme Rocher:
For Groovy and Grails respectively.

Mark Spencer:
Computer engineer, original author of the GTK instant messaging client Gaim (renamed to Pidgin).
He is also the creator of Asterisk, a Linux-based open-sourced PBX in software, as well as l2tpd and the Cheops Network User Interface.

Michael Tiemann:
Contributor to free software include authorship of the GNU C++ compiler and work on the GNU C compiler and the GNU Debugger.

Daniel Robbins:
Founder and former chief architect of the Gentoo Linux project.

Rob Young:
Robert "Bob" Young is a serial entrepreneur whose biggest success has been Red Hat Inc, the open source software company.

Gaël Duval:
Graduate of the Caen University in France. In July 1998, he created Mandrake Linux (now Mandriva Linux), a Linux distribution originally based on Red Hat Linux and KDE.

Spencer Kimball and Peter Mattis:
They both started development of Gimp in 1995 on Berkeley University, California.

Bill Joy / Chuck Haley:
The original code for vi was written by Bill Joy in 1976, as the visual mode for a line editor called ex that Joy had written with Chuck Haley.

Theodore Y. "Ted" Ts'o:
Is a software developer mainly known for his contributions to the Linux kernel, in particular his contributions to file systems (e2fsprogs, the userspace utilities for the ext2 and ext3 filesystems, and maintainer for the ext4 file system).

Harald Welte:
German programmer, well known as a hacker of the Linux kernel, enforcer of the GNU General Public License (GPL).  Also involved in a number of free software projects, such as Openmoko, (a version of Linux for completely open, low-cost, high-volume phones) and the netfilter/iptables project.

Jörg Schilling:
Computer programmer who has worked extensively on compact disc burning software "cdrtools", the Solaris Operating System and the OpenSolaris project. He has also been involved on BerliOS Linux distribution.

Rob Savoye:
Primary developer of Gnash, he is developer of the GNU Project. He has also worked on Red Hat, Debian, and other libre software projects.
Some of the projects he has worked on include GCC, GDB,[3] DejaGnu, Cygwin, eCos and CTAS.

Árpád Gereöffy:
He started development of MPlayer in 2000.

Fabrice Beillard:
Computer programmer who is best known as the creator of the FFmpeg and QEMU software projects.

Dries Buytaert:
Open-source software programmer notable as founder and lead developer of the Drupal CMS.

Ton Roosendaal:
Dutch software developer, known as the original creator of the open-source 3D creation suite Blender.

Andrew Tridgell:
Australian computer programmer Author of the Samba file server and co-inventor of the rsync algorithm.

Brian E. Paul:
Computer programmer who originally wrote and continues to maintain the source code for the open source Mesa graphics library.

John Gilmore:
Worked on several GNU projects, including maintaining the GNU Debugger in the early 90s, initiating GNU Radio in 1998, starting Gnash in December 2005 to create a free software player for Flash movies, and writing the pdtar program which became GNU tar.

Wietse Zweitze Venema:
Dutch programmer and physicist best known for writing the Postfix email system.

Marco Pesenti:
Main developer of Galeon, Gnome's Browser.

Guenter Bartsch:
Creator of Xine Media Player.

Mark Kretschmann:
Creator of the Amarok audio player.

Roger Dannemberg / Dominic Mazzoni:
Creators of the Audacity free digital audio editor.

Olivier Fourdan:
Creator and main developer of the XFCE desktop.

As conclusion, Free Software / Open Source / Libre Software as general concept, is a wide concept contributed by dozens and dozens of thousands of not only software developers, but also software distribution packagers, activists, translators, documentary people and other roles, that have cooperated to establish one of the most important advances on the last fifty years.

Which names are, from your perspective, missing from the list ?
Do you think there should be attributions to other people ?
Which are, in your opinion, the "forgotten names" ?

domingo, 20 de octubre de 2013

The TIOBE programming languages ranking - October 2013

The TIOBE programming community index is a good thermometer to measure the state of usage of a wide number of programming languages (226 in October 2013) to determine which is the most popular programming language in a particular date.




This comparison, which has been performed since 2006, also shows the programming language ranking on the year before, as well as a graphic to show the progress and the percentage of usage.

Updated once a month, TIOBE ratings index does not define the programming language popularity in terms of lines of code. Indeed, it considers:

- The number of skilled engineers world-wide
- Courses
- Third-party vendors

To do so, the search of each of the programming languages is performed on the most known search engines, as well as on social media websites:

- Google
- Bing
- Yahoo!
- Wikipedia
- Youtube
- Baidu
- etc.

In particular, the index of October 2013 has been split in this basis [2]:

  • Google: 7%
  • YouTube: 7%
  • Baidu: 7%
  • Wikipedia: 7%
  • Amazon: 6%
  • Blogger: 6%
  • Google India: 6%
  • Yahoo! Japan: 5%
  • Bing: 5%
  • Hao123: 5%
  • Google Germany: 5%
  • EBay: 4%
  • Google United Kingdom: 4%
  • Google France: 4%
  • Google Brazil: 3%
  • Google Japan: 3%
  • Google Italy: 3%
  • Google Spain: 3%
  • 360 (so.com): 2%
  • BBC: 2%
  • Google Mexico: 2%
  • Amazon Japan: 1%
  • Google Canada: 1%
  • Alibaba: 1%
  • Huffington Post: 1%
  • Amazon Germany: 1%


Once clarified how TIOBE performs the rankings, it is time to show the TIOBE ranking index for October 2013 [1]:

Position
Oct 2013
Position
Oct 2012
Delta in PositionProgramming LanguageRatings
Oct 2013
Delta
Oct 2012
Status
11C17.246%-2.58%  A
22Java16.107%-1.09%  A
33Objective-C8.992%-0.49%  A
44C++8.664%-0.60%  A
56PHP6.094%+0.43%  A
65C#5.718%-0.81%  A
77(Visual) Basic4.819%-0.30%  A
88Python3.107%-0.79%  A
923Transact-SQL2.621%+2.13%  A
1011JavaScript2.038%+0.78%  A
1118Visual Basic .NET1.933%+1.33%  A
129Perl1.607%-0.52%  A
1310Ruby1.246%-0.56%  A
1414Pascal0.753%-0.09%  A
1517PL/SQL0.730%+0.10%  A
1613Lisp0.725%-0.22%  A
1712Delphi/Object Pascal0.701%-0.40%  A
1853Groovy0.658%+0.53%  B
1919MATLAB0.614%+0.02%  B
2026COBOL0.599%+0.15%  B


Having a glance to the previous table, some conclusions can be take:

- Regarding most used programming languages, there are not much changes in the first places. C, Java, Objective-C (pushed by its use on applications development for IOS) and C++ continue on the first places.

- Meanwhile, PHP has increased in one position (6th to 5th position), changing its position with C#, which decrease from the 5th to the 6th position.

- Basic/Visual Basic as well as Python keep their position (7th and 8th respectively) compared to October 2012 ranking.

- The most strong appearance in the top 10 position occurs with Transact/SQL, which is a proprietary extension to SQL developed by Microsoft and Sybase, appearing on the 9th position, what means an increase of 14 positions compared to October 2012 ranking.

- Javascript increases a position, maybe influenced by the increasing popularity of the JQuery library.

- Visual Basic.NET appears also as one of the most increasing used programming languages, rising from 18th to 11th position.

- Among the loosing popularity languages, some of the languages found are Perl, Ruby, Lisp and Delphi/Object Pascal.

- PL/SQL continues to increase its popularity from 17th to 15th position, seeming not to be influenced by the increase of Transact-SQL.

- Groovy is, obviously, the most rising programming language in the 20th first position, increasing from 53th to 18th position. This object-oriented language, which can be considered a subset of Java that is easier to program and to execute, is increasing its popularity due to the Groovy and Grails web framework, and the strong bid that Spring Software is doing to it.

- Last, but not least, appears MATLAB in the 19th position and COBOL increasing from 26th to 20th position.

Another interesting graphic provided by TIOBE is the top 10 programming languages evolution through last years (2001 to nowadays):



This graphic shows how Java continues its decreasing progress, and how exchanges its position with C on the top of the ranking. It is somehow interesting, that, despite of the success of Android platform, Java does not change its progress, at least from this ranking perspective.

It is also a fact that, the rest of programming languages, apart from Transact-SQL and Objective-C, continue to be somehow stable on their position, and always, all of them far from Java and C.

Last, but not least, TIOBE includes the list of the next 30th most used programming languages [1], with other very popular languages as GNU R, SAS, Ada, Fortran, Bash, ABAP, Tcl, Haskell or ActionScript:

PositionProgramming LanguageRatings
21R0.553%
22SAS0.543%
23Ada0.510%
24F#0.499%
25Fortran0.474%
26Assembly0.471%
27Bash0.470%
28Ladder Logic0.457%
29Logo0.433%
30Lua0.413%
31ABAP0.394%
32C shell0.382%
33Common Lisp0.380%
34NXT-G0.366%
35Scheme0.360%
36Scala0.345%
37D0.337%
38Prolog0.328%
39RPG (OS/400)0.319%
40PostScript0.312%
41JavaFX Script0.297%
42Tcl0.294%
43Erlang0.292%
44Max/MSP0.276%
45Scratch0.270%
46Haskell0.245%
47ML0.245%
48PL/I0.240%
49ActionScript0.215%
50Emacs Lisp0.210%

To conclude, just clarify that TIOBE ranking index is just an index with a particular calculation performed by this company. However, other measures could have been included in order to make this ranking more accurate, such as:

1 - lines of code in the most used Source Forges (Github, Bitbucket, SourceForge, GoogleCode)
2 - LinkedIn programming languages groups
3 - Support mailing lists activities
... and many other.

So, apart from being a good thermometer, this ranking should be observed with certain precautions, and not being considered as the definitive state of art around programming languages being used.

Original source: http://www.tiobe.com
References:
[1] http://www.tiobe.com/index.php/content/paperinfo/tpci/index.html
[2] http://www.tiobe.com/index.php/content/paperinfo/tpci/tpci_definition.htm