This will be written in Python, at least for the first cut, just for ease of development -- I think I am at least 2-3 times faster than in C or C++, and bugs may be less severe. I am open to the idea of switching to C at some time in the future, but because that is enormously expensive I want to avoid it until it's clearly necessary. Python is also a good platform to handle cross-platform portability.
Possible reasons to go to C:
That may be useful, if there are substantial sections that approach or meet our goals for both design and implementation (e.g. being good to use from a library.)
This does not necessarily mean doing the whole thing in C; we could call out to tla or could wrap particular bits into libraries.
Erik Bågfors:
However, I think it's very important that a VCS can be wrapped in other languages so that it can be integrated in IDE's and have tools written for them.
A library written in c would be simple to wrap in other languages and therefore could be used from for example monodevelop and friends. I really believe this is important for a VCS.
I agree; this is a more important argument against Python than speed, where I think we can be entirely adequate just using smart design. But there are some partial answers:
We can design bzr to be easily called as an external process -- not depending on interactive input, having systematically parsed output, --format=xml output, etc. This is the only mode CVS supports, and people have built many interesting tools on top of it, and it's still popular for svn and tla. For things like editor integration this is often the easiest way.
Secondly, there is a good chance of calling into Python from other languages. There are projects like Jython, IronPython, Parrot and so on that may well fix this.
Thirdly, we can present a Python library through a C interface; this might seem a bit weird but I think it will work fine. Python is easily embeddable; this might be the best way for Windows IDE integration.
Finally, if none of these work, then we can always recode in C, treating Python only as a prototype.
I think working in Python I can develop it at least twice as fast as in C, particularly in this early phase where the design is still being worked out. Although all other things being equal it might be nice to be in pure C, but I don't think it's worth paying that price.
One of the problems with darcs is that it's such a mess wrapping it.
Yes.
Experiments to date on large trees show that even with little optimization, bzr is mostly disk-bound, and the CPU time usage is only a few seconds. That supports the position that Python performance will be adequate.