Scripting Issue to remove square characters

ve3auo · March 27, 2023, 1:27pm

Hello All,

I think I have found an "undocumented feature" as I always called them when I was very active developing.

I have discovered that many of my files have strange characters in the title field. How they got there is unknown. I am using MP3Tag to try and remove them although I am running into a few issues/blockages with the available scripting options.

The "offending characters" are in the title field. They are at the leftmost position and are only displayed as a small, skinny, vertical, rectangle (see screen caps).

I have tried regex, scripting and whatever else I thought might work from MP3Tag.

In the screen caps, I tried $right(%title%,$len(%title%)-1) as well as $right(%title%,$len(%title%)-10). As you can see in the captures, they both did the same thing. Nothing.
20230327 - Tagging Error - 1

I even tried "filtering" to set the list to display only files with those "unknown" characters using regex and even regex could not reference those chars. It's like they are there but they aren't.

So far, I can only remove them manually and there are a substantial amount. I'd love to be able to automate it.

Clayton

ohrenkino · March 27, 2023, 1:39pm

You could check one of the titles in the extended tags dialogue Alt+T with the edit function and see what these characters are.

Another could be to copy them, paste them as search string into an action of the type "Replace" and replace them with nothing.

And if all that does not help - perhaps you could supply one of these files so that the community could have a closer look.

ohrenkino · March 27, 2023, 1:55pm

This is not the correct syntax.
If you want to subtract a value, you have to use the function $sub(), e.g.
$right(%title%,$sub($len(%title%),10))

ohrenkino · March 27, 2023, 2:03pm

To filter for control characters like 27 or 7 you could use this filter:
"$if($less($ord(%title%),32),1,0)" IS 1

ve3auo · March 27, 2023, 2:31pm

This worked to remove the unknown character. Thank you

ve3auo · March 27, 2023, 2:34pm

"$if($less($ord(%title%),32),1,0)" IS 1

This, I'm guessing, would work IF I knew what the code was for the character.

ohrenkino · March 27, 2023, 2:35pm

No, it would filter for all files where the first letter of TITLE has a code lower than 32 which represents the space character.
It would be so much easier, if you simply tried it.

MotleyG · March 27, 2023, 2:36pm

There was some suggestions mage here about how to see what unicode characters may be hidden in text.

ve3auo · April 7, 2023, 5:09pm

Just an update,

I found the source of these strange characters.

I am using a combination of Picad and MP3Tag to clean up a very large library for a terrestrial and online radio station. They use Station Playlist. Once I finish the tagging, updating and corrections of the files, I run them through the "Track Tool" of SPL. It creates the ReplayGain and the CUE, Seque, etc. points. I also edit the SPL specific fields as needed for classification. It seems when this is done, the strange characters are introduced into various fields.

Time to contact the author of that software and advise him of this happening.

Thanks everyone