HF Digital Voice Modulation modes

Sent from ProtonMail mobile

-------- Original Message --------
On Oct 18, 2021, 2:49 AM, Tom, wb6b < wb6b@...> wrote:

On Sun, Oct 17, 2021 at 11:04 PM, Jerry Gaffke wrote:

Hans has a nice 50W class C PA for the QCX if you wanted
Hi Jerry,

That is a good idea. It is so easy to only think of transceivers now days. A separate transmitter and using the uBitx, or even a SDR, as the receiver is a good alternative.?

If I did modify the uBitx, I'd try to do it in such a way as to not mess up the bias for SSB, I'd use something as simple as a relay to switch the RF source and bias when trying the digital modes.?

I just made a trip to my museum of unfinished projects. Found a box with a really nice Turkish stamp of a very happy little girl operating a soldering iron on it, and found a 10W HF amplifier kit within. Along with some low pass filter kits. This could be a good start. I image I could convert it to class C without much trouble.?

Tom, wb6b

#90577

On Sun, Oct 17, 2021 at 10:24 AM, Jerry Gaffke wrote:

We can do more with a single carrier, and the FreeDV codec has advanced considerably since then

Jerry,

I thought that FreeDV was a multiple carrier mode and would require a linear amp.? Is there a version that is single carrier mode?

73
Evan
AC9TU

#90578

On Mon, Oct 18, 2021 at 05:18 AM, Evan Hand wrote:

I thought that FreeDV was a multiple carrier mode and would require a linear amp.

I was proposing to use the FreeDV codec but use the GMSK modulation type. Or another FSK type it it is better.?

I don't think there's any reason not to mix and match codecs (voice encoders) and modulation schemes.

Tom, wb6b

#90579

As I understand it, GMSK as used by D-Star is just two tones.??
In GMSK, the transition from one tone to the other is made as soft as possible
to reduce the bandwidth of the resultant signal, the key click filter on a?
CW transmitter does something similar.

Looking around the web, I don't see much talk about D-Star modulation for HF.
My guess is that anybody using D-Star on HF just uses the same modulation scheme
as on VHF, ignoring the FCC's 300 baud max symbol rate.? ? Does anybody know for sure?
If AM phone is still legal, I suppose they would argue that a 6khz wide D-Star
transmission should be too.? Or perhaps I am wrong about the 300 baud max.

Jerry, KE7ER

Show quoted text

On Mon, Oct 18, 2021 at 05:38 AM, Tom, wb6b wrote:

On Mon, Oct 18, 2021 at 05:18 AM, Evan Hand wrote:

I thought that FreeDV was a multiple carrier mode and would require a linear amp.
I was proposing to use the FreeDV codec but use the GMSK modulation type. Or another FSK type it it is better.?

I don't think there's any reason not to mix and match codecs (voice encoders) and modulation schemes.

Tom, wb6b

#90580

Gmsk is really very simple.

First, it picks two frequencies that the carrier switches between. Second, the frequency is switched preciselt where the signal hits the zero voltage. Thus, it avoids the abrupt ending of one frequency waveform and start of the next.

- f

Show quoted text

On Mon, Oct 18, 2021, 9:15 PM Jerry Gaffke via <jgaffke=[email protected]> wrote:

As I understand it, GMSK as used by D-Star is just two tones.??
In GMSK, the transition from one tone to the other is made as soft as possible
to reduce the bandwidth of the resultant signal, the key click filter on a?
CW transmitter does something similar.

Looking around the web, I don't see much talk about D-Star modulation for HF.
My guess is that anybody using D-Star on HF just uses the same modulation scheme
as on VHF, ignoring the FCC's 300 baud max symbol rate.? ? Does anybody know for sure?
If AM phone is still legal, I suppose they would argue that a 6khz wide D-Star
transmission should be too.? Or perhaps I am wrong about the 300 baud max.

Jerry, KE7ER

On Mon, Oct 18, 2021 at 05:38 AM, Tom, wb6b wrote:

On Mon, Oct 18, 2021 at 05:18 AM, Evan Hand wrote:

I thought that FreeDV was a multiple carrier mode and would require a linear amp.
I was proposing to use the FreeDV codec but use the GMSK modulation type. Or another FSK type it it is better.?

I don't think there's any reason not to mix and match codecs (voice encoders) and modulation schemes.

Tom, wb6b

#90583

That's a good description of MSK, but there is a second order discontinuity when it switches tones.? The slope of the curve makes an abrupt change there.? GMSK makes the transition more gradual, but still centers the transition on the zero crossing like MSK.
??
A modulation scheme such as Olivia can choose from dozens of different tones sent one at a time, not just two different tones.? With enough different tones to choose from, you can probably have enough bandwidth to meet the needs of Codec2 voice transmissions and still meet the US FCC's 300 baud max symbol rate.? But I'm guessing the total bandwidth occupied would not be much less than a two tone signal switched at a fast enough rate to give the same bandwidth.? Switching between many different tones can probably be done using the GMSK method, but all the basic descriptions of GMSK I've seen so far only tell us about the two tone version.

Most Codec2 transmissions on HF take it a step further, and send multiple tones simultaneously to get even more bandwidth out of that 300 baud max symbol rate.? These transmissions must go through a linear amplifier.

Jerry, KE7ER

Show quoted text

On Mon, Oct 18, 2021 at 08:59 AM, Ashhar Farhan wrote:

Gmsk is really very simple.
First, it picks two frequencies that the carrier switches between. Second, the frequency is switched preciselt where the signal hits the zero voltage. Thus, it avoids the abrupt ending of one frequency waveform and start of the next.

- f

#90584

The codec2 with packet overheads will come to approximately 1500 bps. With the 300 baud limit, we are looking at 1500/300 = 5 bits per baud.?

We slice this many ways. The easiest way is to have five tones. One for each bit. Further, you can reduce it to half, or lets say 3 bits with each bit having four values at 0, 90, 180 and 270 degree phase shift from the previous phase, thus conveying two bits. Further, you can add two amplitude levels double from four to eight different? values, bringing the bits per tone to eight.

These are usually plotted on a phase/amplitude graph as a constellation.

These higher bits per baud accordingly need better cnr to resolve the exact phase or amplitude.

Wspr/jt set of modulation schemes limit themselves to a single frequency modulated tone. It achieves remarkable relience by averaging the carrier frequency over tens of cycles

- f

Show quoted text

On Mon, Oct 18, 2021, 10:13 PM Jerry Gaffke via <jgaffke=[email protected]> wrote:

That's a good description of MSK, but there is a second order discontinuity when it switches tones.? The slope of the curve makes an abrupt change there.? GMSK makes the transition more gradual, but still centers the transition on the zero crossing like MSK.
??
A modulation scheme such as Olivia can choose from dozens of different tones sent one at a time, not just two different tones.? With enough different tones to choose from, you can probably have enough bandwidth to meet the needs of Codec2 voice transmissions and still meet the US FCC's 300 baud max symbol rate.? But I'm guessing the total bandwidth occupied would not be much less than a two tone signal switched at a fast enough rate to give the same bandwidth.? Switching between many different tones can probably be done using the GMSK method, but all the basic descriptions of GMSK I've seen so far only tell us about the two tone version.

Most Codec2 transmissions on HF take it a step further, and send multiple tones simultaneously to get even more bandwidth out of that 300 baud max symbol rate.? These transmissions must go through a linear amplifier.

Jerry, KE7ER

On Mon, Oct 18, 2021 at 08:59 AM, Ashhar Farhan wrote:

Gmsk is really very simple.
First, it picks two frequencies that the carrier switches between. Second, the frequency is switched preciselt where the signal hits the zero voltage. Thus, it avoids the abrupt ending of one frequency waveform and start of the next.

- f

#90585

Yes, that all sounds right.?

You can do a lot with 1 carrier, but the more you do the more susceptible it is to noise.
I am curious how much more bandwidth 2 tone GMSK needs over something using?
the most sophisticated multiple tones, multiple phases, multiple amplitudes scheme
for equal performance in the presence of typical HF noise.?
I'd guess not much, but could be wrong.? Regardless, on HF we would be forced
to do that stuff if we must follow this absurd max symbol rate of 300 baud.

D-Star on VHF gets by with choosing between just two tones, has a bandwidth of 6khz
for fair voice quality.? Bitrate is higher than what Codec2 can do,
Codec2 should get it down to 3 or 4 khz for the same voice quality.

At 1500 bps, I'd guess Codec2 is just barely intelligible,
everybody sounding like a 50 year old TV robot.

Jerry, KE7ER

Show quoted text

On Mon, Oct 18, 2021 at 10:58 AM, Ashhar Farhan wrote:

The codec2 with packet overheads will come to approximately 1500 bps. With the 300 baud limit, we are looking at 1500/300 = 5 bits per baud.?
We slice this many ways. The easiest way is to have five tones. One for each bit. Further, you can reduce it to half, or lets say 3 bits with each bit having four values at 0, 90, 180 and 270 degree phase shift from the previous phase, thus conveying two bits. Further, you can add two amplitude levels double from four to eight different? values, bringing the bits per tone to eight.

These are usually plotted on a phase/amplitude graph as a constellation.

These higher bits per baud accordingly need better cnr to resolve the exact phase or amplitude.

Wspr/jt set of modulation schemes limit themselves to a single frequency modulated tone. It achieves remarkable relience by averaging the carrier frequency over tens of cycles

- f

#90587

On Mon, Oct 18, 2021 at 12:59 PM, Jerry Gaffke wrote:

max symbol rate of 300 baud.

The regulations are as clear as mud, but one interpretation may be that is perfectly OK to transmit digital voice at baud rates high enough for digital voice. And doing so with GMSK as well as other digital modulation modes.

Just as long as you transmit digital in the voice part of the HF bands. If you want to transmit digital anything in the parts of the HF bands reserved for RITTY/Data then the 300 baud limitation come into play. Interesting, they don't even mention CW anymore. It is all data now.?

"(2) No non-phone emission shall exceed the bandwidth of a communications quality phone emission of the same modulation type. The total bandwidth of an independent sideband emission (having B as the first symbol), or a multiplexed image and phone emission, shall not exceed that of a communications quality A3E emission."

Above paragraph cut from this document:

Tom, wb6b

#90588

With 1.6 kbit/s you get very good voice with state-of-the-art AI-based
voice encoders (eg. LPCNet or SSMGAN). Please take some time to hear:


And try also with FreeDV 2020 (with LPCNet).

1.5kbit/s with audio barely intelligible was 25 years ago with some
second generation vocoders... which were indeed pretty good for the
time. Codec2 in standard version go down to 700 bps, and down to 450 bps
with data-driven reconstruction techniques.

Cheers,
Rafael

Show quoted text

On 10/18/21 10:59 PM, Jerry Gaffke via groups.io wrote:
Yes, that all sounds right.?

You can do a lot with 1 carrier, but the more you do the more
susceptible it is to noise.
I am curious how much more bandwidth 2 tone GMSK needs over something
using?
the most sophisticated multiple tones, multiple phases, multiple
amplitudes scheme
for equal performance in the presence of typical HF noise.?
I'd guess not much, but could be wrong.? Regardless, on HF we would be
forced
to do that stuff if we must follow this absurd max symbol rate of 300
baud.

D-Star on VHF gets by with choosing between just two tones, has a
bandwidth of 6khz
for fair voice quality.? Bitrate is higher than what Codec2 can do,
Codec2 should get it down to 3 or 4 khz for the same voice quality.

At 1500 bps, I'd guess Codec2 is just barely intelligible,
everybody sounding like a 50 year old TV robot.

Jerry, KE7ER

On Mon, Oct 18, 2021 at 10:58 AM, Ashhar Farhan wrote:

    The codec2 with packet overheads will come to approximately 1500
    bps. With the 300 baud limit, we are looking at 1500/300 = 5 bits
    per baud.?
    We slice this many ways. The easiest way is to have five tones.
    One for each bit. Further, you can reduce it to half, or lets say
    3 bits with each bit having four values at 0, 90, 180 and 270
    degree phase shift from the previous phase, thus conveying two
    bits. Further, you can add two amplitude levels double from four
    to eight different? values, bringing the bits per tone to eight.
    These are usually plotted on a phase/amplitude graph as a
    constellation.
    These higher bits per baud accordingly need better cnr to resolve
    the exact phase or amplitude.
    Wspr/jt set of modulation schemes limit themselves to a single
    frequency modulated tone. It achieves remarkable relience by
    averaging the carrier frequency over tens of cycles
    - f

OpenPGP_signature

#90590

On Tue, Oct 19, 2021 at 04:34 AM, Rafael Diniz wrote:

With 1.6 kbit/s you get very good voice with state-of-the-art AI-based
voice encoders

Hi Rafael,

I see you have an interesting paper covering some experiments with the new trained neural net voice encoders/encoders used on HF radio links.

For those interested, the pdf can be uploaded to google translate and it will spit back a translated version.?

Tom, wb6b

#90592

Wow!
Things have advanced considerably.
Those 1.6 kbps voice samples were better than I expected.
At 5 GMAC/s, the computational load is significant but within easy reach.

Jerry, KE7ER

Show quoted text

On Tue, Oct 19, 2021 at 04:34 AM, Rafael Diniz wrote:

With 1.6 kbit/s you get very good voice with state-of-the-art AI-based
voice encoders (eg. LPCNet or SSMGAN). Please take some time to hear:

And try also with FreeDV 2020 (with LPCNet).

1.5kbit/s with audio barely intelligible was 25 years ago with some
second generation vocoders... which were indeed pretty good for the
time. Codec2 in standard version go down to 700 bps, and down to 450 bps
with data-driven reconstruction techniques.

Cheers,
Rafael

#90596

Hi Tom,

I'm preparing a new paper in English to send for a conference soon. I
let you know!
; )

Also, for images, HIFIC, for example, provides significant better
compression than traditional non-AI based image encoders.

Rafael
PU2UIT

Show quoted text

On 10/19/21 3:49 PM, Tom, wb6b wrote:
On Tue, Oct 19, 2021 at 04:34 AM, Rafael Diniz wrote:

    With 1.6 kbit/s you get very good voice with state-of-the-art AI-based
    voice encoders

Hi Rafael,

I see you have an interesting paper covering some experiments with the
new trained neural net voice encoders/encoders used on HF radio links.
'adio_de_longa_distancia_utilizando_a_banda_de_HF
<'adio_de_longa_distancia_utilizando_a_banda_de_HF>

For those interested, the pdf can be uploaded to google translate and
it will spit back a translated version.?

Tom, wb6b

OpenPGP_signature

#90606

On Wed, Oct 20, 2021 at 03:24 AM, Rafael Diniz wrote:

Also, for images, HIFIC, for example, provides significant better
compression than traditional non-AI based image encoders.

Hi Rafael,

That is really cool. I built an interface to TensorFlow many years ago and had fun feeding images to a early neural net that was all the rage at time. Haven't done much with AI in recent time. Looks like some great strides have been made in real world applications.

Because you are involved currently in AI and Neural Nets, I have a couple of random questions.

1) Regarding the AI image compression. Is it possible that there would be adversarial patterns/images that could produce completely unexpected results. An example was a university study where people taped a piece of paper to their chest with a random looking patterns printed on them. It fooled the AI into thinking the pattern was a face and it spit out a "match" Was some time ago so I don't remember exactly what the result was.

2) Something I've toying around with and trying to find any possible related research. That is, would it be possible to train an AI (possibility using the same training set as used for the speech compression) to listen to the "Donald Duck" sounds of a SSB transmission received by an AM receiver, to produce fully intelligible speech? Reading about the AI speech compressors and reading about the training data sets that are available, may make it possible for me to give it a try.

I wonder how much it is going to drain my bank account setting up an Amazon AWS server with GPUs and run the training?

Look forward to your paper.

Tom, wb6b

#90608

Rafael,

Is a CW decoder possible with band waterfall in its input? I don't know how to start with tensorflow. It would be dandy if someone can write? a simple HelloWorld for us old timers to start using TensorFlow.

- f

Show quoted text

On Thu, Oct 21, 2021, 5:09 AM Tom, wb6b <wb6b@...> wrote:

On Wed, Oct 20, 2021 at 03:24 AM, Rafael Diniz wrote:

Also, for images, HIFIC, for example, provides significant better
compression than traditional non-AI based image encoders.
Hi Rafael,

That is really cool. I built an interface to TensorFlow many years ago and had fun feeding images to a early neural net that was all the rage at time. Haven't done much with AI in recent time. Looks like some great strides have been made in real world applications.

Because you are involved currently in AI and Neural Nets, I have a couple of random questions.

1) Regarding the AI image compression. Is it possible that there would be adversarial patterns/images that could produce completely unexpected results. An example was a university study where people taped a piece of paper to their chest with a random looking patterns printed on them. It fooled the AI into thinking the pattern was a face and it spit out a "match" Was some time ago so I don't remember exactly what the result was.

2) Something I've toying around with and trying to find any possible related research. That is, would it be possible to train an AI (possibility using the same training set as used for the speech compression) to listen to the "Donald Duck" sounds of a SSB transmission received by an AM receiver, to produce fully intelligible speech? Reading about the AI speech compressors and reading about the training data sets that are available, may make it possible for me to give it a try.

I wonder how much it is going to drain my bank account setting up an Amazon AWS server with GPUs and run the training?

Look forward to your paper.

Tom, wb6b

#90612

On Wed, Oct 20, 2021 at 05:10 PM, Ashhar Farhan wrote:

It would be dandy if someone can write? a simple HelloWorld for us old timers to start using TensorFlow.

Here is what Amazon offers for ML (AI) development.

This is Amazon's getting started page.

However there are other prebuilt examples that can be downloaded trained and run for practice. If I find anything interesting in my notes from when I discovered this Amazon service existed I can add some links. I seem to remember a basic step by step training course that let you work through examples, starting from very "simple" and let you progressively work your way up to more complex things.?

There is a big learning curve and you need to get familiar with AWS. I know AWS but only at the beginning of the learning curve of the sagemaker service.

I started to play a bit here but became involved with other priorities. The AI based speech compression that came up in these posts and the thoughts of things that might be possible by adapting that work to other uses is restoring my interest.?

Tom, wb6b

#90616

This maybe an interesting quick read for those interested in AI/ML in the embedded space.

Article has links to a series of training videos.?

There is also a NXP Connects coming up soon.
EMEA Nov 9-10 | Americas Nov 10-11 | Asia-Pacific Nov 16-17
https://www.nxp.com/company/about-nxp/events/nxp-connects:NXP-CONNECTS

Digikey, Arrow, Mouser are sponsors.
(in the past,? NXP sold some of the boards used in the training sessions through supplier channels at discounted rates)

Rgds,
Gary

#90618

Hi Tom,

1) Regarding the AI image compression. Is it possible that there would
be adversarial patterns/images that could produce completely
unexpected results. An example was a university study where people
taped a piece of paper to their chest with a random looking patterns
printed on them. It fooled the AI into thinking the pattern was a face
and it spit out a "match" Was some time ago so I don't remember
exactly what the result was.I never got such strange behavior with HIFIC, but it is a matter of us
trying out. Indeed, the performance is not orders of magnitude better
than VVC (H.266) - by reading the papers. But in my experiences, with
the lower bitrate HIFIC (0.160 bpp) is better than state-of-the-art
traditional image encoding (aka. VVC). Indeed, images look much better
(do not look the PSNR thou). Use the? "hific-lo" model in (I'm using the
CPU version in my old Lenovo T430, and for still images, you just need
at least 16 GB ram to play with and get the results after less than 30s!):



2) Something I've toying around with and trying to find any possible
related research. That is, would it be possible to train an AI
(possibility using the same training set as used for the speech
compression) to listen to the "Donald Duck" sounds of a SSB
transmission received by an AM receiver, to produce fully intelligible
speech? Reading about the AI speech compressors and reading about the
training data sets that are available, may make it possible for me to
give it a try.Certainly yes. An AI NN trained for such purpose could reconstruct a
good quality voice from SSB quality "Donald Duck" analog voice. Noise is
the most challenging thing here as far as I played with.

I wonder how much it is going to drain my bank account setting up an
Amazon AWS server with GPUs and run the training?:P
In the university we have some nice GPUs to play with. If you really
want to play with this, I bet nothing, as there will be many people
offering you GPU access to play with such computer science edge stuff.

Look forward to your paper.Yay!

Cheers,
Rafael

OpenPGP_signature

#90619

Hi Farhan,

Do you have any "traditional" automatic CW decoder reference for
performance comparison? I hope to get something better than what exists,
otherwise it does not make any sense.

Rafael

Show quoted text

On 10/21/21 3:10 AM, Ashhar Farhan wrote:
Rafael,
Is a CW decoder possible with band waterfall in its input? I don't
know how to start with tensorflow. It would be dandy if someone can
write? a simple HelloWorld for us old timers to start using TensorFlow.
- f

On Thu, Oct 21, 2021, 5:09 AM Tom, wb6b <wb6b@...
<mailto:wb6b@...>> wrote:

    On Wed, Oct 20, 2021 at 03:24 AM, Rafael Diniz wrote:

        Also, for images, HIFIC, for example, provides significant better
        compression than traditional non-AI based image encoders.

    Hi Rafael,

    That is really cool. I built an interface to TensorFlow many years
    ago and had fun feeding images to a early neural net that was all
    the rage at time. Haven't done much with AI in recent time. Looks
    like some great strides have been made in real world applications.

    Because you are involved currently in AI and Neural Nets, I have a
    couple of random questions.

    1) Regarding the AI image compression. Is it possible that there
    would be adversarial patterns/images that could produce completely
    unexpected results. An example was a university study where people
    taped a piece of paper to their chest with a random looking
    patterns printed on them. It fooled the AI into thinking the
    pattern was a face and it spit out a "match" Was some time ago so
    I don't remember exactly what the result was.

    2) Something I've toying around with and trying to find any
    possible related research. That is, would it be possible to train
    an AI (possibility using the same training set as used for the
    speech compression) to listen to the "Donald Duck" sounds of a SSB
    transmission received by an AM receiver, to produce fully
    intelligible speech? Reading about the AI speech compressors and
    reading about the training data sets that are available, may make
    it possible for me to give it a try.

    I wonder how much it is going to drain my bank account setting up
    an Amazon AWS server with GPUs and run the training?

    Look forward to your paper.

    Tom, wb6b

OpenPGP_signature

#90620

Rafael,

?

This Morse decoder is really good ==>

and it is open source.

I didn't look at the code, as I was just browsing for a?morse decoder a few days ago, but it works really well!

73,

Rafael

开云体育

HF Digital Voice Modulation modes