Concating file A with A, first A is silent

No, there wasn't any download link, blue or any other color, but the repeated directly here link seems to work.

Have downloaded it, but not looked it at yet.

Now I have played and it is unfortunately the same.

But since it works as expected at your end I have to experiment with different players, they might treat files differently.

I just used the VLC player on Windows on my PC.

Your "dormi4secdormi.mp3" plays in some players but not all, although there is more than 4 seconds of silence.
I believe the problem is that "dormi.mp3" is actually an MP2 file, not MP3.
You can see this in Mp3tag.

I changed the extension of "dormi.mp3" to MP2, and created a 4 second silent MP2 file with the same sample rate and bitrate as "dormi.mp3".
Try concatenating with the new files and see if it plays correctly.
It does for me.
The embedded image is not in the resulting file, so that might require a new command line, or maybe some additional steps in your workflow.
MP2dormi.zip (25.6 KB)

1 Like

I can confirm that:

image

ffmpeg seems to create a concatenated MPEG 1 Layer III file.

I did the same thing as you and got the same result, works as intended but without the image.

Before that I tried the original contacted file with several media players, the first one was VLC, no luck.

BUT, here's the thing, two media players managed to play it as intended, both dormi audibles and with the image.

Those players are VSO Media Player and SMPlayer.

What I think those players have in common is that they state no need for external codecs.

For simple concatenation to work, you MUST strip all metadata tags from the files first. They cannot occur within the final audio data. You also should strip the Xing frame if any is present, or the duration will be shown incorrectly. Some players may stop early, others may play past the estimated duration. All media files MUST have the same sampling frequency, number of channels and SHOULD have the same joint stereo mode. Your first example has an embedded picture with a screenshot showing inflection of the word "to sleep".

MPEG 2 means low sampling frequency of 16, 22 or 24 kHz. Your silence must match it. It is not the same as Layer II (MP2). You should not use the MP2 extension.

Concat works only with elementary streams of the following formats: MP2, MP3, AC3, EC3, DTS, AAC.