The crux of such a distributed project is to keep the commitment needed to contribute under a certain, rather low threshold. Looking at music, this certainly means breaking up pages into single staffs (software might help here), and maybe even introducing various rounds to add information types. Round one concentrates on just getting the notes right, round two adds slurs and other decorations, round three adds text annotations, etc. A final round will be needed to sync everything up, and produce some final output. At this point, you may also have to deal with optimizing the notations for various print / sheet sizes, and produce neat PDFs for easy printing (if not automated in the site code where such things could be generated on-demand from the sources)emeraldimp wrote:[...] The workflow of Mutopia is a single 'typesetter' submitting a completed project, not a distributed group working on the same project. Again, similar to Project Gutenberg before DP.
Also, the project wouldn't have to use the Lilypond format as its storage/output. It could use something else - musicXml, for example - and translate into lilypond for rendering during the proofing, and provide musicXml + ly + whatever as the ultimate output.
Note that with each round, you can render the partial result, to help people decide how accurate they have been.
Lilypond makes some effort to make its notation compact and easy to type, which makes it preferred above such things as musicXml, which makes it easier on computers. However, human time being so much more precious than computer time, we should look at the thing that works best for humans.
Of course, transcribed works cannot replace scans. They can supplement them, as being easier to read and search (especially if a specialized search engine could be made to find certain themes and tunes, even if transposed, etc.)
The distributed aspect is not that the server side is distributed, but that the effort of transcribing is distributed, to as many people as possible. Note that music notation, like math, is complicated, and has a smaller audience than plain English texts, so the throughput will not be 1000s of pages per day, as with PG Distributed Proofreading, which produces about 200 books per month.