Unable to do simple audio transformations

SlimyRedstone · Postby **SlimyRedstone** » Mon Oct 09, 2023 9:55 pm

I'm trying to do a I2S microphone + headphone example program with basic audio manipulation (eg: gain, delay, etc...)
I have successfully done the simple part that gets the data from the microphone (using the INMP441, 24bits @ 48kHz), and sends it to the headphones (using the UDA1334, 24bits @ 48kHz).
I'm using ESP-IDFv5.1.1 in VSCode, on the ESP32-S3 N32R8

The I2S parameters are :
(this is generated on startup)

Polling interval: 1000ms
Buffer size: 1920 bytes
DMA frame number: 240 bytes
DMA description number: 9 bytes

--- Input ---
Bits per sample: 24
Frame length: 32
Sample rate: 48kHz
Channels: Stereo
Bit shift: Yes
Left align: Yes

--- Output ---
Bits per sample: 24
Frame length: 32
Sample rate: 48kHz
Channels: Stereo
Bit shift: No
Left align: No

However, I don't know how to change the gain. I've done some research and found that, to change the gain of a signal, you have to multiply the sampled signal by the gain.
When I do it, I only get noise on the output.
When I don't try to modify the gain, the output is acting normal, I can hear myself talk into the mic.

Here is a "simplified" version of the whole code, I only removed methods & variables not used in this part (tell me if you need more but this is really the part responsible for the I2S data)
Note: I also tried using the example given by Espressif, although in the example the I2S is 16bits, but got the same result

Code: [Select all] [Expand/Collapse]

#define CONVERT8bits(n,o) (uint8_t)((n>>8*o)&0xFF)
 
double gain = 1.5;
 
void setAudioGain(uint8_t *buffer, size_t bLength, double gain) {
    uint32_t *sample = calloc(2,sizeof(uint32_t)); // L : R
    for (uint32_t sIndex = 0; sIndex < bLength; sIndex += 6 * 4) {
        for (uint8_t j=0;j<6*4;j+=6){
        // Here, Left = 0 and Right = 1
            sample[Left]  = buffer[sIndex + j + 0 ]<<16 | buffer[sIndex + j + 1 ]<<8 | buffer[sIndex + j + 2 ];
            sample[Right] = buffer[sIndex + j + 3 ]<<16 | buffer[sIndex + j + 4 ]<<8 | buffer[sIndex + j + 5 ];
 
            sample[Left]  = (uint32_t)((double)sample[Left]*gain);
            sample[Right] = (uint32_t)((double)sample[Right]*gain);
 
            *((uint8_t *)(buffer + sIndex + j + 0 )) = CONVERT8bits(sample[Left],2);  //  Left Channel MSB
            *((uint8_t *)(buffer + sIndex + j + 1 )) = CONVERT8bits(sample[Left],1);  //  Left Channel
            *((uint8_t *)(buffer + sIndex + j + 2 )) = CONVERT8bits(sample[Left],0);  //  Left Channel LSB
 
            *((uint8_t *)(buffer + sIndex + j + 3 )) = CONVERT8bits(sample[Right],2);  //  Right Channel MSB
            *((uint8_t *)(buffer + sIndex + j + 4 )) = CONVERT8bits(sample[Right],1);  //  Right Channel
            *((uint8_t *)(buffer + sIndex + j + 5 )) = CONVERT8bits(sample[Right],0);  //  Right Channel LSB
        }
    }
    free(sample);
}
 
 
static void audio_task(void *args) {
    uint8_t *data = calloc(BUFFER_SIZE,sizeof(uint8_t));
    audio_init();
    while (true) {
        i2s_channel_read(i2s_rx, data, BUFFER_SIZE, &n_bytes_read, 10e3); // 10 sec max
        setAudioGain(data, BUFFER_SIZE, 1.1); // If I comment this, the sound is normal
        i2s_channel_write(i2s_tx, data, BUFFER_SIZE, &n_bytes_write, 10e3); // 10 sec max
    }
    free(data);
    vTaskDelete(NULL);
}

GeSHi © Codebox Plus Extension

Postby **ESP_Sprite** » Tue Oct 10, 2023 4:24 am

If anything, your code expect your samples to be big-endian. Is that true?

SlimyRedstone · Postby **SlimyRedstone** » Tue Oct 10, 2023 7:10 am

Both I2S port are configured as little endian (or big endian to false) through this parameter
Do you think this is the source of the problem ?
https://docs.espressif.com/projects/esp ... ig_endianE

(extract from the code)

Code: [Select all] [Expand/Collapse]

i2s_std_slot_config_t tx_slot_config = {
    .data_bit_width = I2S_SPK_BITS_PER_SAMPLE,
    .slot_bit_width = I2S_SPK_BITS_PER_CHANNEL,
    .slot_mode = I2S_SLOT_MODE_STEREO,
    .slot_mask = I2S_STD_SLOT_BOTH,
    .ws_width = I2S_SPK_BITS_PER_CHANNEL,
    .ws_pol = true,
    .bit_shift = false,
    .left_align = false,
    .big_endian = false,
    .bit_order_lsb = false,
};
i2s_std_slot_config_t rx_slot_config = {
    .data_bit_width = I2S_MIC_BITS_PER_SAMPLE,
    .slot_bit_width = I2S_MIC_BITS_PER_CHANNEL,
    .slot_mode = I2S_SLOT_MODE_STEREO,
    .slot_mask = I2S_STD_SLOT_BOTH,
    .ws_width = I2S_MIC_BITS_PER_CHANNEL,
    .ws_pol = false,
    .bit_shift = true,
    .left_align = true,
    .big_endian = false,
    .bit_order_lsb = false,
};

GeSHi © Codebox Plus Extension

SlimyRedstone · Postby **SlimyRedstone** » Tue Oct 10, 2023 5:40 pm

I tried to set manually the sample into little endian and in big endian, still noise.

Code: [Select all] [Expand/Collapse]

// @return Returns the first 24 bits
#define as24bit(n) (uint32_t)(n&0xFFFFFF)
 
#define CONVERT32bits(b,o) (uint32_t)(b[o]<<16 | b[o+1]<<8 | b[o+2])
/*
  @param n The number to convert
  @param o Byte offset (0:Right, 1:Middle, 2:Left)
  @return Returns the nth byte from a variable (from right to left)
*/
#define CONVERT8bits(n,o) (uint8_t)((n>>8*o)&0xFF)
 
/*
  @return Transform 0x11223344 into 0x44332211
*/
#define LEtoBE(n) (uint32_t)( CONVERT8bits(n,0)<<24 | CONVERT8bits(n,1)<<16 | CONVERT8bits(n,2)<<8 | CONVERT8bits(n,3) )
 
 
sample[Left] = as24bit( LEtoBE(sample[Left]) );
sample[Right] = as24bit( LEtoBE(sample[Right]));

GeSHi © Codebox Plus Extension

MicroController · Postby **MicroController** » Tue Oct 10, 2023 10:26 pm

Plus, the samples are signed, two's complement values; so you have to sign-extend them to int32_t before multiplying.

SlimyRedstone · Postby **SlimyRedstone** » Wed Oct 11, 2023 12:34 am

MicroController wrote: ↑
Tue Oct 10, 2023 10:26 pm
Plus, the samples are signed, two's complement values; so you have to sign-extend them to int32_t before multiplying.

I also tried converting the values into signed int32_t, either by bit shifting or by map() (to get a more accurate conversion, I guess).
Nothing works. I don't know why, because when I log (printf the variable) every line, before and after each manipulation, it outputs coherent data.

(Before treatment)
sample[Left] = 0X831700 (8591104)
sample[Right] = 0X831700 (8591104)

(Applying the gain: 1.1, should slightly amplify the signal)

(After treatment)
sample[Left] = 0X9032E6 (9450214)
sample[Right] = 0X9032E6 (9450214)

(Putting uint32_t into 3 uint8_t *)
[L] buffer[0] = 0X90 (144)
[L] buffer[1] = 0X32 (50)
[L] buffer[2] = 0XE6 (230)

(Putting uint32_t into 3 uint8_t *)
[R] buffer[3] = 0X90 (144)
[R] buffer[4] = 0X32 (50)
[R] buffer[5] = 0XE6 (230)

Postby **ESP_Sprite** » Wed Oct 11, 2023 3:45 am

Erm... you sure the I2S doesn't require 24 bits of sample data in 32-bits variables?

SlimyRedstone · Postby **SlimyRedstone** » Wed Oct 11, 2023 11:11 am

ESP_Sprite wrote: ↑
Wed Oct 11, 2023 3:45 am
Erm... you sure the I2S doesn't require 24 bits of sample data in 32-bits variables?

Do you mean using 1 u32bit variable instead of a 3 u8bit ? I already tried that as well
In the datasheet of the UDA1334 (page 10 https://www.nxp.com/docs/en/data-sheet/ ... df#page=10),
in 24 bit mode, the word select signal lasts (expected to be a minimum 24 bit clock cycle, maximum 32 bit clock cycle, see page 9 #8.6.1) 32 bit clock cycle, where the 24 last are the actual I2S data (MSB on the left).
I've just tried setting 24 bit frame instead of 32, still the same.

MicroController · Postby **MicroController** » Wed Oct 11, 2023 6:48 pm

(Before treatment)
sample[Left] = 0X831700 (8591104)

That would be a large negative value (-8186112), very close to the maximum amplitude representable in 24bits (-8388608).
Double-check the byte order in the buffer.
And make sure you operate on the signed values. (int32_t signed32 = ((int32_t)(sample24 << 8)) >> 8)

SlimyRedstone · Postby **SlimyRedstone** » Thu Oct 12, 2023 7:17 pm

MicroController wrote: ↑
Wed Oct 11, 2023 6:48 pm

(Before treatment)
sample[Left] = 0X831700 (8591104)
That would be a large negative value (-8186112), very close to the maximum amplitude representable in 24bits (-8388608).
Double-check the byte order in the buffer.
And make sure you operate on the signed values. (int32_t signed32 = ((int32_t)(sample24 << 8)) >> 8)

Can you write me a basic code that would do what you saying ? I'm pretty sure I've already tried that but I can't get my head around it.
This is what I would write (with your addition):
(Note: I've used definitions to ease its comprehension, but if you don't recommend using #define, I will not use it)

Code: [Select all] [Expand/Collapse]

/*
  @param n The number to convert
  @param o Byte offset (w/ 32bit -> 0:Right most, 3:Left most)
  @return Returns the nth byte from a variable (from right to left)
*/
#define CONVERT8bits(n,o) (uint8_t)((n>>8*o)&0xFF)
 
#define CONVERT32bits(b,o) (int32_t)(b[o]<<16 | b[o+1]<<8 | b[o+2])
 
void setAudioGain(uint8_t *buf, size_t bLength, double gain) {
  int32_t *sample = calloc(2,sizeof(int32_t)); // L : R
  for (uint32_t sIndex = 0; sIndex < bLength; sIndex += 6 * 4) {
    for (uint8_t j=0;j<6*4;j+=6) {
        sample[Left]  = CONVERT32bits(buf, j + 0 );
        sample[Right] = CONVERT32bits(buf, j + 3 );
 
        sample[Left] = ((int32_t)(sample[Left]<<8))>>8;
        sample[Right] = ((int32_t)(sample[Right]<<8))>>8;
 
        sample[Left]  = ((double)sample[Left]*gain);
        sample[Right] = ((double)sample[Right]*gain);
 
      *((uint8_t *)(buf + sIndex + j + 0 )) = CONVERT8bits(sample[Left],2);   //  Left Channel MSB
      *((uint8_t *)(buf + sIndex + j + 1 )) = CONVERT8bits(sample[Left],1);   //  Left Channel
      *((uint8_t *)(buf + sIndex + j + 2 )) = CONVERT8bits(sample[Left],0);   //  Left Channel LSB
 
      *((uint8_t *)(buf + sIndex + j + 3 )) = CONVERT8bits(sample[Right],2);  //  Right Channel MSB
      *((uint8_t *)(buf + sIndex + j + 4 )) = CONVERT8bits(sample[Right],1);  //  Right Channel
      *((uint8_t *)(buf + sIndex + j + 5 )) = CONVERT8bits(sample[Right],0);  //  Right Channel LSB
    }
  }
  free(sample);
}

Unable to do simple audio transformations

Unable to do simple audio transformations

Re: Unable to do simple audio transformations

Re: Unable to do simple audio transformations

Re: Unable to do simple audio transformations

Re: Unable to do simple audio transformations

Re: Unable to do simple audio transformations

Re: Unable to do simple audio transformations

Re: Unable to do simple audio transformations

Re: Unable to do simple audio transformations

Re: Unable to do simple audio transformations

Who is online

About Us

Extra

Information