Before I go and spend time programming my own, is anyone aware of a free program that can take images from a score (say from an extracted score pdf on this site), and attempt to split it into the different parts?
I don't necessarily need to digitize it, though that would also lead to a solution too of course. It seems something a good quality composition program would be able to do (scan a score and digitize it), though I wouldn't know if they could do it from image (no technical reason why not), but I'm not very familiar with any of those tools, or their costs.
Many thanks.
Score splitter program
Moderator: kcleung
-
- Site Admin
- Posts: 1139
- Joined: Sun Jan 14, 2007 8:16 am
- notabot: YES
- notabot2: Bot
- Location: Perth, Australia
- Contact:
-
- Groundskeeper
- Posts: 1445
- Joined: Sun Oct 05, 2008 3:01 pm
- notabot: YES
- notabot2: Bot
- Location: U.S.A.
- Contact:
On a related subject, are there any pdf-splitting (I.E. individual pages) free programs for linux? It would seem that they all cost a lot, and I just dont like to pay for something that should be free.
BTW, I tried PDFsam, and it didn't work - I don't think it's intended for large files like the one I need to split (BGA 44)
BTW, I tried PDFsam, and it didn't work - I don't think it's intended for large files like the one I need to split (BGA 44)
Formerly known as "perlnerd666"
Re: Vivaldi -- Yeah, that was what I wanted to avoid. Or rather, automate. It doesn't seem like it would be too hard to write something that would do that process to close enough accuracy (or also to reduce hint time for misses). Just wanted to see if someone has already written it before I re-engineer the wheel.
-
- active poster
- Posts: 293
- Joined: Sun Apr 23, 2006 5:08 am
- notabot: YES
- notabot2: Bot
- Location: Phoenix, AZ
Look up pdftk. It's based on the Java pdf library iText. So you could write a program using this library to do it also. The clunkiest option is using ghostscript. In environments that don't allow 3rd party tools (i.e. my work) I've had to make a script which splits the pdf using ghostscript. It's not pretty, but it'd doable.On a related subject, are there any pdf-splitting (I.E. individual pages) free programs for linux?
This isn't an easy problem. Many scores omit parts in systems where they just rest. The same part will jump around from system to system. I've also seen horn parts switch the pairing. For one system it will be I+II, III+IV then for the next it will be I, II+III+IV. Many times full scores omit the short instrument names at the beginning of the system. The instruments are determined from context.Before I go and spend time programming my own, is anyone aware of a free program that can take images from a score (say from an extracted score pdf on this site), and attempt to split it into the different parts?
I don't think there are any programs out there currently which try to solve this problem. I say go for it. Let us know if you come up with anything. Even if the input was the score and a list of staves to pick out of each system it would still be immensely easier than using an image editor.
I will guiltily admit that I am a string player, and will use this mostly for splitting quartet scores where most of aforementioned problems are rare. That being said, odds are a lot of good groundwork for handling more complex problems will fall out. Human in the loop capability needs to be there anyway for correction to the algorithm. Time will tell. If anyone else knows anything, I'll still watch this thread, but I'll start playing around this week if I find time. If/when I come up with anything useful, I'll post a new thread.
-
- Groundskeeper
- Posts: 1445
- Joined: Sun Oct 05, 2008 3:01 pm
- notabot: YES
- notabot2: Bot
- Location: U.S.A.
- Contact:
I suck at ghostscript. Period. And Pdftk doesn't install (I'm missing some packages that need some packages that need the first packages)Look up pdftk. It's based on the Java pdf library iText. So you could write a program using this library to do it also. The clunkiest option is using ghostscript. In environments that don't allow 3rd party tools (i.e. my work) I've had to make a script which splits the pdf using ghostscript. It's not pretty, but it'd doable.
Thanks anyways!
I'll Try just selecting and saving manually... That might work....[/quote]
-
- active poster
- Posts: 293
- Joined: Sun Apr 23, 2006 5:08 am
- notabot: YES
- notabot2: Bot
- Location: Phoenix, AZ
Here's another solution (really pdftk is the best option however if you can make it work): pdfimages. It spits out a pnm for each page so you lose page size and dpi information. If you know that (or don't care to lose it) you can then use imagemagick (optionally with tiffcp + tiff2pdf) to get a pdf again.I suck at ghostscript. Period. And Pdftk doesn't install (I'm missing some packages that need some packages that need the first packages)
Thanks anyways!
I'll Try just selecting and saving manually... That might work....