Chapter 6. Correcting Data Automatically

Table of Contents

1. Remote Correct
1.1. What are MusicBrainz and AmpliFIND?
1.2. What is Discogs?
1.3. How Does Jaikoz Use MusicBrainz and AmpliFIND?
1.4. Retrieve Acoustic Ids
1.4.1. AmpliFIND Preferences
1.5. Autocorrect Metadata from MusicBrainz
1.5.1. AutoMatch Preferences
1.5.2. Match Preferences
1.5.3. Format
1.5.4. Format 2
1.5.5. Format Preferences
1.6. Autocorrect Metadata from Discogs
1.7. ManualCorrect Tags from MusicBrainz
1.7.1. Manual Match Preferences
1.8. Correct Lyrics
1.9. Update Metadata from Existing MusicBrainz Id
1.10. Update Metadata from Discogs
1.11. Submissions to Musicbrainz
1.11.1. Submit MusicBrainz/PUID Pair
1.11.2. Submit MusicBrainz Genres
1.11.3. Submit Musicbrainz Collection
2. Local Correct
2.1. Correct Artwork
2.2. Delete Duplicates
2.3. Correct Track Nos
2.4. Cross Referencing Correct - Correct Artists/Albums/Titles/Genres/Recording Times/Comments/Composers
2.4.1. How it Works
2.4.2. AutoMatch Preferences
2.4.2.1. Match words that appear misspelt
2.4.2.2. Ignore Word Order when matching
2.4.2.3. Ignore Case when matching
2.4.2.4. Ignore words in this list when matching
2.4.2.5. Split into Words using these values
2.4.3. AutoFormat Preferences
2.4.3.1. Remove whitespace at start or end of value
2.4.3.2. Remove Widespace
2.4.3.3. Remove undisplayable characters
2.4.3.4. Capitalization
2.4.3.5. Replace words by another word
2.4.3.6. Remove these Punctuation Characters
3. File and Folder Correct
3.1. Shift Base Folder to Sub Folder
3.2. Shift Sub Folder to Base Folder
3.3. Correct Tags from Filename
3.3.1. Example
3.3.2. Introduction
3.3.3. How it Works
3.3.4. Split the Filename using these words
3.4. Correct Filenames from Tags
3.5. Correct Sub Folders from Tags
4. Delete Duplicates
5. Auto Correcter

Auto Correction allows your song information to be corrected without you manually entering the information, this is a much quicker and more accurate way of sorting out your music. Jaikoz can perform Local Correct this does not require internet access, and is very quick but can only work with the metadata that you already have in your files. Much more powerful are Autocorrect from Musicbrainz and Autocorrect from Discogs, these use online databases to do comprehensive matches, this is much more accurate but takes longer. File and Folder Correct is another form of Local Correct that modifies your folder and filenames, rather than just your metadata. The Autocorrecter allows you to batch up a number of these tasks into a single task. Usually you would use all these methods to clean up your Music Library

1. Remote Correct

Remote correct groups tasks together that require Internet access in order to run

1.1. What are MusicBrainz and AmpliFIND?

MusicBrainz is an online database of information on more than nine million songs. This is a community based database with contributions by over 200,000 people, with a comprehensive system of moderation ensuring the data is extremely accurate. Additionally many of these songs have associated Acoustic Ids provided by AmpliFIND, allowing a song to be identified by the actual music, so it can do a match even if you have no metadata. AmpliFIND also provides an online database and a number of Music Mixing and Music Identification services, this is the very latest technology and is much more powerful than other systems that only allow a match by metadata.

1.2. What is Discogs?

Discogs is another online database of information similar to Musicbrainz, Discogs also allows you to buy and sell records and CD's to other Discogs users. We use Musicbrainz and Discogs to get the best possible coverage of your song library.

1.3. How Does Jaikoz Use MusicBrainz and AmpliFIND?

Jaikoz allows to lookup your songs by both the acoustic id and the metadata making it very accurate, no decisions have to me made by you whilst the correction is progressing so you can go away and do something else more interesting and review the changes when complete. Jaikoz compares your track acoustically with the AmpliFIND database and if it finds a match it retrieves the acoustic id. It can then be used to contact the MusicBrainz Server to find matching tracks and meta-data. Sometimes multiple tracks may be returned but unlike other systems Jaikoz can use its its AutoMatch algorithm to determine which is the correct match in most cases. It does this by comparing the meta-data in the records returned from MusicBrainz with the record it is trying to match. As a user this means you can run Jaikoz against MusicBrainz and the corrections will be done without any additional intervention required from yourself. This is very useful because the creation of Acoustic ids and lookup in the MusicBrainz Database can take a while if you want to correct many files so is best run unattended for a large number of files. If a match is found Jaikoz will always write a record to the Unique File Identifier which provides a link back to the Music Brainz Website. It also writes values to many of the other fields such as the Music Id (PUID), Artist Id, Album Id, Disc Id, AmazonId, artist, album, title, year, album artist, release status, release type, release country and track number if they do not exist already on your file, or if they are allowed to be overwritten based on your AutoFormat Preferences.

1.4. Retrieve Acoustic Ids

This attempts to find an acoustic match for your track in the AmpliFIND database. In order to get a successful match the song must exist in the AmpliFIND database and your song must be of sufficient quality so that it sounds similar enough to the song in the AmpliFIND database, if an Acoustic Id is found it will be added to the AmpliFIND Id field within your song. Acoustic matching takes a few seconds per track but the Acoustic Id for a track never changes so there is no need to regenerate it, it only has to be done once. Because the Acoustic Id for a particular file will never change, the Acoustic Id will be automatically saved to file as it is retrieved. By default Jaikoz does not submit unknown tracks to AmpliFIND because analysing unknown tracks take significantly longer, but this option can be enabled. When you submit an unknown track to AmpliFIND it will not give you an Acoustic Id immediately, you have to wait 24 hours and then retry retrieving acoustic ids for the unknown track. By this time the track should have been added to the AmpliFIND database and you will normally able to retrieve an acoustic id for it.

1.4.1. AmpliFIND Preferences

You can change the following options the Preferences/MusicBrainz/AmpliFIND window.

The Save Acoustic Ids to file as retrieved save the acoustic id to the file as it is retrieved. This protects you from losing your work against a system crash, because acoustic ids are always correct, no verfication is required by the user

To analyse songs that are not found in the AmpliFIND database use the Analyse and submit tracks that do not currently exist in AmpliFIND database option

You may retrieve acoustic ids more quickly by increasing the Maximum Cpus to use when retrieving Acoustic Ids but by default it is only set to one because there have been some incidences of crashing when using multiple cpus in some scenerios.

When an acoustic id is retrieved from AmpliFIND it can usually provide the artist and the song title as well. You can add these to your songs metadata if empty, always or not at all using the Artist and Title options.

1.5. Autocorrect Metadata from MusicBrainz

This attempts to find a match in the MusicBrainz Database, using metadata such as artist, album and title, track duration and track number and/or the Acoustic Id. Jaikoz groups your songs by album if they have album field specified and/or by folder and first of all try to find a matching release on Musicbrainz for all the songs in the grouping, a sophisticated algorithm analyses potential matches and selects the one that matches all songs the best and is most similar to your preferences including preferred country and media format. If no suitable match is found Jaikoz then does song by song matching based on the metadata, and then for remaining songs it matches based on acoustic ids. Jaikoz will complete as many fields as it can from MusicBrainz if they are enabled in your AutoFormat options.

Because different customers use Jaikoz in different ways there are a number of preferences you can change

1.5.1. AutoMatch Preferences

The Do not match online if already have a MusicBrainz Unique id option allows you to skip over songs that have already had a Unique id, this option is enabled by default.

If the matching Musicbrainz song has a match to a Discogs Release Jaikoz can use Discogs to retrieve additional information for the song, if you do not wish to do this then check Do not update from Discogs when matching

By default Autocorrect from Musicbrainz uses your songs metadata, but if it cannot find a suitable match it will try and do a match based on the songs acoustic id if it it already has one. If Retrieve Acoustic Ids as required to help with matching is enabled then it will try and get acoustic ids for missing songs as required, so with this option enabled you do not need to run the Retrieve Acoustic Ids task seperately, however if you do it would normally make sense to disable this option.

Jaikoz can find the orginal release a song was releasd on, this is most useful when you are trying to match a song on a compilation, but sometimes more searches are required to do this accurately and this can slow down the totasl matching time so is disabled by default, you ca enable it using the Do extra searches to find original releases

1.5.2. Match Preferences

These preferences apply to both matching from Musicbrainz and matching from Discogs

The Prefer do not match to Various Artist Compilations and Prefer do not match to Single Artist Compilations options will give preference to matching a song on an original release such as original album or single, and will only match to a compilation album as a last resort.

The selected Preferred Country of release will give song linked to a releases in this country an improved score.

The selected Preferred Media Formats will give songs with releases of this type an improved score.

If Preferred Release Date is set to Earliest Release Date this will boost scores of releases with the earlier release dates, you can also specify Latest Release Date and No Preference

The Only match complete releases option is useful if you have a collection that already had good metadata and you want to ensure that only really good matches based on your groupings are made.

If your songs are already organised one album per folder you can get better matching if you enable Group Songs by folder only.

1.5.3. Format

This lists most of the values that can possibly be updated by MusicBrainz upon a successful match, this includes text fields and Artwork. By default all fields are populated on a MusicBrainz match if MusicBrainz has a value, but you can select to only Replace if empty or Never Alter the field.

When matching cover art Jaikoz will attempt to finest the highest quality image it can from a variety of sources, the image is added to the audio file automatically but like all tasks the image will not be saved until you save changes.

The Translate foreign artist names to english where possible uses the english version of latin names where possible. this is useful if youre system is English/latin based but you are have some tracks by non lation artists such as Chinese or Japanese which would be difficult to use otherwise.

This is the current list of fields that can be auto populated

  • Artist
  • Album Artist
  • Sort Artist
  • Sort Album Artist
  • Title
  • Year
  • Track No
  • Is Compilation
  • Disc No
  • Original Artist
  • Original Lyricist
  • Language
  • Album
  • MB Disc Id
  • Release Type
  • Release Status
  • Release Country
  • Amazon Id
  • Artwork
  • Composer
  • Disc Total
  • Original Album
  • Original Release Year
  • Script

The Translate foreign artist names to english where possible option can be used to convert artist names containing non latin characters to latin script, this is very useful if you can only understand latin script

The Put Original Year into Year field can be used when the orginal album is found so that the original year is stored in the year field. When creating playlists in some players you may want to create date based playlists i.e Songs from the 60's and only some players provide access to the original release year.

1.5.4. Format 2

This lists additional values that can possibly be updated by MusicBrainz upon a successful match.

  • Label
  • Barcode
  • Cat No
  • Media
  • Lyricist
  • Conductor
  • Producer
  • DJ Mixer
  • Remixer
  • Genre
  • Grouping
  • Release Discogs Url
  • Release Wikipedia Url
  • Release Official Url
  • Sort Composer
  • ISRC
  • Track Total
  • Engineer
  • Mixer

Musicbrainz can store Folksonomy Tags at Artist, Release and Track level. These tags are often (but not always) used to associate an artist, release or track with a musical genre. With Jaikoz you can specify at what level should genre tags be taken from, and how many genres you want to link to any one song. Because most players only understand the first genre linked to a song some customers put a list of genres in the grouping field as a single field to make it easier to create multiple genre based playlists. So with Jaikoz you can also specify whether to write genres to the grouping field as well, independently of the values added to the genre field.

1.5.5. Format Preferences

These preferences apply to both matching from Musicbrainz and matching from Discogs

Add EP and Single release types to release title is useful for distinguishing between a release and a single of the same name, it is common for a release to have the same name as a song on the album and if that song is released as a single there is nothing to differentiate between them.

Sometimes a recording artist may use a different name on some of their releases, this can mke it harder to manage your collection. If you enable Use Artist Name instead of Artist Credit Name then the artist name will always be used in your songs metadata.

Sometimes a recording is given another name on a different release, to force a consistent name to be used enable Use Recording Name instead of Track Name

The different discs of a multi disc release can be determined using the Disc No field. But it is also possible to add the discno to the title by setting Multi Disc Releases to Always add Disc No information to the release title. Discs within a disc can sometimes titles, if you want these to be added to the release title use Add Disc No to the release title if disc has title

1.6. Autocorrect Metadata from Discogs

This attempts to find a match in the Discogs Database, using metadata such as artist, album and title, track duration and track number. Jaikoz groups your songs by album if they have album field specified and/or by folder and first of all try to find a matching release on Discogs for all the songs in the grouping, a sophisticated algorithm analyses potential matches and selects the one that matches all songs the best and is most similar to your preferences including preferred country and media format. If no suitable match is found Jaikoz then does song by song matching based on the metadata. Jaikoz will complete as many fields as it can from Discogs if they are enabled in your AutoFormat options, by default if the song has already been matched to a Musicbrainz release only empty fields will be modified by Discogs

If you are an editor of Musicbrainz you may find the Only match if matches to Musicbrainz release option useful. When this is enabled Autocorrect from Discogs will only try to match to Discogs if the song has a Musicbrainz Id, and will only accept the match if the Discogs metadata is a good match with the existing Musicbrainz metadata. This is a a very useful way of finding links between Musicbrainz and Discogs that you can then add to the Musicbrainz release as a Discogs Relationship Type.

The Match Settings also apply to Discogs

  • Artist
  • Album
  • Album Artist
  • Release Type
  • Release Status
  • Title
  • Year
  • Track No
  • Disc No
  • Is Compilation
  • Release Country
  • Conductor
  • Composer
  • Label
  • Barcode
  • Cat No
  • ISRC
  • Media
  • Lyricist
  • Track Total
  • Disc Total
  • Remixer
  • Artwork

Discogs categorises releases by styles and sub categories called genres. With Jaikoz you can complete the genre field using styles and/or genres, and select how many genres you want to link to any one song. Because most players only understand the first genre linked to a song some customers put a list of genres in the grouping field as a single field to make it easier to create multiple genre based playlists. So with Jaikoz you can also specify whether to write genres to the grouping field as well, independently of the values added to the genre field.

1.7. ManualCorrect Tags from MusicBrainz

For each (selected) track this finds upto ten potential matches in the MusicBrainz Database, a similar algorithm is used as for the AutoCorrect from Musicbrainz but is track based instead of release based , and instead of automatically selecting the best match, upto ten matches are displayed and it is your decision to select a match or not. Matches by Acoustic Id are shown first, then matches by meta data - sorted by their rating. Each choice is displayed on a separate row for each release event so that you can select the Release Date, Record Label and Catalog No to select when there are multiple choices. You can modify what fields are displayed and in what order and these changes are preserved. The last column contains either Search for the master record or View for the choices, if you select View Jaikoz opens the release in Musicbrainz so that you can look in more detail. If you select Search it opens Musicbrainz Search you can then select one of the results or search again to find the track that you want, then select the tagger button on the webpage and Jaikoz will use this selection.

If you select the popup menu for a song you have some further options. Match Songs to Selected Release looks at the Release Id of the selected song, it then checks all the other potential matches for the other songs and selects any match that has the same Release Id. This is a nice easy way of matching a whole album of songs just by finding the best match for one song. View Track in Musicbrainz and View Release in Musicbrainz open the selected match in Musicbrainz so that you can check al the details.

By default records are processed in batches of ten, after the first ten songs are processed the matches are displayed in a dialog. Whilst you are reviewing the options the next batch of songs is processed in the background. If you select Ok the songs are updated to your matches. This continues until there there are no more records to process, unless you select Cancel. You can select Reset to undo any changes you have made in the current batch that you are reviewing.

1.7.1. Manual Match Preferences

The Number of records to process before showing dialog sets how many records are processed before displaying the potential matches in a dialog when running ManualCorrect Tags from MusicBrainz.

1.8. Correct Lyrics

Jaikoz can retrieve the lyrics for many of your songs using an online database.

1.9. Update Metadata from Existing MusicBrainz Id

For each (selected) song this checks that they already have a MusicBrainz Id, and if they do retrieve the latest information from MusicBrainz for this Id and updates the relevent fields in the record if they are enabled in your Format options. If the track does not have a MusicBrainz Id but it does have a Musicbrainz Release Id then Jaikoz tries to find the correct track on the release by comparing the trackname, track number and track duration. If Musicbrainz contains a link to a Discogs Release Id the songs will also be updated with any more information available from Discogs.

This has a number of uses:

  • Ensuring that you have the latest information from MusicBrainz because MusicBrainz may have improved/corrected/increased the information held for this track since you originally matched it.

  • Jaikoz may not have been able to find a MusicBrainz Match itself, so you can find the correct track using the MusicBrainz Website yourself and then enter the id into the UniqueId field, then use this task to update all the fields based on this Id.

  • Even easier, just find the Release Id and then paste the Musicbrainz Release Id for every track into the Release Id field

1.10. Update Metadata from Discogs

For each (selected) song this checks that they already have a Release Discogs Url, and if they do then Jaikoz retrieves the latest information from Discogs for this Release Id and tries to find the correct track on the release by comparing the trackname, track number and track duration of the tracks with the Discogs release information. Jaikoz then updates the relevent fields in the record if they are enabled in your Format options. The Release Discogs Url can be added after a successful Musicbrainz lookup but if you're song does not have one you can search on Discogs for it yourself and add it manually.

Whether or not you have already tagged your track from Musicbrainz, Discogs may contain additional information that is not included in the Musicbrainz Database. You can also use this to help identify information that you can be added into Musicbrainz. Discogs Cover Art is generally higher quality than is available from Musicbrainz.

1.11. Submissions to Musicbrainz

The MusicBrainz Server field specifies what MusicBrainz server to use, currently there is only one but it is expected there will be mirror servers available in the future.

In order to submit data to MusicBrainz you need to have a user account, these are completely free and can be created at the MusicBrainz Website. If you have a user account specify your username and password in this tab in the MusicBrainz Username and MusicBrainz password fields. By submitting data to Musicbrainz you are helping to improve this open source database and maintain the most accurate music resource in existence. By submitting information for songs that you own you create an archive of information that will never be lost even if your own files get damaged. Currently the majority of information that is contributed to Musicbrainz is via their website, but you can contribute some information automatically via Jaikoz. if you do this please ensure the information you submit is accurate and of a high quality.

1.11.1. Submit MusicBrainz/PUID Pair

If you successfully create an Acoustic Id (AmpliFIND PUID) for a track and you have successfully found a match on MusicBrainz, and your track has a Music Brainz Unique File Id, you can submit the pairing to MusicBrainz the database. You should only submit a pairing if you have confirmed that the pairing has been correctly and identified and you should not submit the same pairing more than once.

1.11.2. Submit MusicBrainz Genres

Musicbrainz has a Folksonomy cloud that can be used to tag artists, releases and tracks with anything you like. Jaikoz uses this to add genres at releases and track level. Because genres are subjective this feature is going to become more useful as more people submit their genres, so please give it a go. Jaikoz also uses the folksonomy cloud to fix the genre field when correcting songs from Musicbrainz.

1.11.3. Submit Musicbrainz Collection

Musicbrainz can store a list of the release you own and inform you of new releases by your favourite artists. Submitting releases can be done via the website but Submit Musicbrainz Collection provides an easier way to submit all the releases loaded in Jaikoz in one go.