Keyboard Shortcuts
ctrl + shift + ? :
Show all keyboard shortcuts
ctrl + g :
Navigate to a group
ctrl + shift + f :
Find
ctrl + / :
Quick actions
esc to dismiss
Likes
Search
HF Digital Voice Modulation modes
Sent from ProtonMail mobile -------- Original Message -------- On Oct 18, 2021, 2:49 AM, Tom, wb6b < wb6b@...> wrote:
|
On Mon, Oct 18, 2021 at 05:18 AM, Evan Hand wrote:
I thought that FreeDV was a multiple carrier mode and would require a linear amp.I was proposing to use the FreeDV codec but use the GMSK modulation type. Or another FSK type it it is better.? I don't think there's any reason not to mix and match codecs (voice encoders) and modulation schemes. Tom, wb6b |
As I understand it, GMSK as used by D-Star is just two tones.??
toggle quoted message
Show quoted text
In GMSK, the transition from one tone to the other is made as soft as possible to reduce the bandwidth of the resultant signal, the key click filter on a? CW transmitter does something similar. Looking around the web, I don't see much talk about D-Star modulation for HF. My guess is that anybody using D-Star on HF just uses the same modulation scheme as on VHF, ignoring the FCC's 300 baud max symbol rate.? ? Does anybody know for sure? If AM phone is still legal, I suppose they would argue that a 6khz wide D-Star transmission should be too.? Or perhaps I am wrong about the 300 baud max. Jerry, KE7ER On Mon, Oct 18, 2021 at 05:38 AM, Tom, wb6b wrote: On Mon, Oct 18, 2021 at 05:18 AM, Evan Hand wrote: |
Gmsk is really very simple. First, it picks two frequencies that the carrier switches between. Second, the frequency is switched preciselt where the signal hits the zero voltage. Thus, it avoids the abrupt ending of one frequency waveform and start of the next. - f On Mon, Oct 18, 2021, 9:15 PM Jerry Gaffke via <jgaffke=[email protected]> wrote: As I understand it, GMSK as used by D-Star is just two tones.?? |
That's a good description of MSK, but there is a second order discontinuity when it switches tones.? The slope of the curve makes an abrupt change there.? GMSK makes the transition more gradual, but still centers the transition on the zero crossing like MSK.
toggle quoted message
Show quoted text
?? A modulation scheme such as Olivia can choose from dozens of different tones sent one at a time, not just two different tones.? With enough different tones to choose from, you can probably have enough bandwidth to meet the needs of Codec2 voice transmissions and still meet the US FCC's 300 baud max symbol rate.? But I'm guessing the total bandwidth occupied would not be much less than a two tone signal switched at a fast enough rate to give the same bandwidth.? Switching between many different tones can probably be done using the GMSK method, but all the basic descriptions of GMSK I've seen so far only tell us about the two tone version. Most Codec2 transmissions on HF take it a step further, and send multiple tones simultaneously to get even more bandwidth out of that 300 baud max symbol rate.? These transmissions must go through a linear amplifier. Jerry, KE7ER On Mon, Oct 18, 2021 at 08:59 AM, Ashhar Farhan wrote: Gmsk is really very simple. |
The codec2 with packet overheads will come to approximately 1500 bps. With the 300 baud limit, we are looking at 1500/300 = 5 bits per baud.? We slice this many ways. The easiest way is to have five tones. One for each bit. Further, you can reduce it to half, or lets say 3 bits with each bit having four values at 0, 90, 180 and 270 degree phase shift from the previous phase, thus conveying two bits. Further, you can add two amplitude levels double from four to eight different? values, bringing the bits per tone to eight. These are usually plotted on a phase/amplitude graph as a constellation. These higher bits per baud accordingly need better cnr to resolve the exact phase or amplitude. Wspr/jt set of modulation schemes limit themselves to a single frequency modulated tone. It achieves remarkable relience by averaging the carrier frequency over tens of cycles - f On Mon, Oct 18, 2021, 10:13 PM Jerry Gaffke via <jgaffke=[email protected]> wrote: That's a good description of MSK, but there is a second order discontinuity when it switches tones.? The slope of the curve makes an abrupt change there.? GMSK makes the transition more gradual, but still centers the transition on the zero crossing like MSK. |
Yes, that all sounds right.?
toggle quoted message
Show quoted text
You can do a lot with 1 carrier, but the more you do the more susceptible it is to noise. I am curious how much more bandwidth 2 tone GMSK needs over something using? the most sophisticated multiple tones, multiple phases, multiple amplitudes scheme for equal performance in the presence of typical HF noise.? I'd guess not much, but could be wrong.? Regardless, on HF we would be forced to do that stuff if we must follow this absurd max symbol rate of 300 baud. D-Star on VHF gets by with choosing between just two tones, has a bandwidth of 6khz for fair voice quality.? Bitrate is higher than what Codec2 can do, Codec2 should get it down to 3 or 4 khz for the same voice quality. At 1500 bps, I'd guess Codec2 is just barely intelligible, everybody sounding like a 50 year old TV robot. Jerry, KE7ER On Mon, Oct 18, 2021 at 10:58 AM, Ashhar Farhan wrote: The codec2 with packet overheads will come to approximately 1500 bps. With the 300 baud limit, we are looking at 1500/300 = 5 bits per baud.? |
On Mon, Oct 18, 2021 at 12:59 PM, Jerry Gaffke wrote:
max symbol rate of 300 baud.The regulations are as clear as mud, but one interpretation may be that is perfectly OK to transmit digital voice at baud rates high enough for digital voice. And doing so with GMSK as well as other digital modulation modes. Just as long as you transmit digital in the voice part of the HF bands. If you want to transmit digital anything in the parts of the HF bands reserved for RITTY/Data then the 300 baud limitation come into play. Interesting, they don't even mention CW anymore. It is all data now.? "(2) No non-phone emission shall exceed the bandwidth of a communications quality phone emission of the same modulation type. The total bandwidth of an independent sideband emission (having B as the first symbol), or a multiplexed image and phone emission, shall not exceed that of a communications quality A3E emission."
Above paragraph cut from this document: Tom, wb6b |
With 1.6 kbit/s you get very good voice with state-of-the-art AI-based
toggle quoted message
Show quoted text
voice encoders (eg. LPCNet or SSMGAN). Please take some time to hear: And try also with FreeDV 2020 (with LPCNet). 1.5kbit/s with audio barely intelligible was 25 years ago with some second generation vocoders... which were indeed pretty good for the time. Codec2 in standard version go down to 700 bps, and down to 450 bps with data-driven reconstruction techniques. Cheers, Rafael On 10/18/21 10:59 PM, Jerry Gaffke via groups.io wrote:
Yes, that all sounds right.? |
On Tue, Oct 19, 2021 at 04:34 AM, Rafael Diniz wrote:
With 1.6 kbit/s you get very good voice with state-of-the-art AI-basedHi Rafael, I see you have an interesting paper covering some experiments with the new trained neural net voice encoders/encoders used on HF radio links. For those interested, the pdf can be uploaded to google translate and it will spit back a translated version.? Tom, wb6b |
Wow!
toggle quoted message
Show quoted text
Things have advanced considerably. Those 1.6 kbps voice samples were better than I expected. At 5 GMAC/s, the computational load is significant but within easy reach. Jerry, KE7ER On Tue, Oct 19, 2021 at 04:34 AM, Rafael Diniz wrote:
With 1.6 kbit/s you get very good voice with state-of-the-art AI-based |
Hi Tom,
toggle quoted message
Show quoted text
I'm preparing a new paper in English to send for a conference soon. I let you know! ; ) Also, for images, HIFIC, for example, provides significant better compression than traditional non-AI based image encoders. Rafael PU2UIT On 10/19/21 3:49 PM, Tom, wb6b wrote:
On Tue, Oct 19, 2021 at 04:34 AM, Rafael Diniz wrote: |
On Wed, Oct 20, 2021 at 03:24 AM, Rafael Diniz wrote:
Also, for images, HIFIC, for example, provides significant betterHi Rafael, That is really cool. I built an interface to TensorFlow many years ago and had fun feeding images to a early neural net that was all the rage at time. Haven't done much with AI in recent time. Looks like some great strides have been made in real world applications. Because you are involved currently in AI and Neural Nets, I have a couple of random questions. 1) Regarding the AI image compression. Is it possible that there would be adversarial patterns/images that could produce completely unexpected results. An example was a university study where people taped a piece of paper to their chest with a random looking patterns printed on them. It fooled the AI into thinking the pattern was a face and it spit out a "match" Was some time ago so I don't remember exactly what the result was. 2) Something I've toying around with and trying to find any possible related research. That is, would it be possible to train an AI (possibility using the same training set as used for the speech compression) to listen to the "Donald Duck" sounds of a SSB transmission received by an AM receiver, to produce fully intelligible speech? Reading about the AI speech compressors and reading about the training data sets that are available, may make it possible for me to give it a try. I wonder how much it is going to drain my bank account setting up an Amazon AWS server with GPUs and run the training? Look forward to your paper. Tom, wb6b |
Rafael, Is a CW decoder possible with band waterfall in its input? I don't know how to start with tensorflow. It would be dandy if someone can write? a simple HelloWorld for us old timers to start using TensorFlow. - f On Thu, Oct 21, 2021, 5:09 AM Tom, wb6b <wb6b@...> wrote: On Wed, Oct 20, 2021 at 03:24 AM, Rafael Diniz wrote: |
On Wed, Oct 20, 2021 at 05:10 PM, Ashhar Farhan wrote:
It would be dandy if someone can write? a simple HelloWorld for us old timers to start using TensorFlow.Here is what Amazon offers for ML (AI) development. This is Amazon's getting started page. However there are other prebuilt examples that can be downloaded trained and run for practice. If I find anything interesting in my notes from when I discovered this Amazon service existed I can add some links. I seem to remember a basic step by step training course that let you work through examples, starting from very "simple" and let you progressively work your way up to more complex things.? There is a big learning curve and you need to get familiar with AWS. I know AWS but only at the beginning of the learning curve of the sagemaker service. I started to play a bit here but became involved with other priorities. The AI based speech compression that came up in these posts and the thoughts of things that might be possible by adapting that work to other uses is restoring my interest.? Tom, wb6b |
This maybe an interesting quick read for those interested in AI/ML in the embedded space.
Article has links to a series of training videos.? There is also a NXP Connects coming up soon. EMEA Nov 9-10 | Americas Nov 10-11 | Asia-Pacific Nov 16-17 https://www.nxp.com/company/about-nxp/events/nxp-connects:NXP-CONNECTS Digikey, Arrow, Mouser are sponsors. (in the past,? NXP sold some of the boards used in the training sessions through supplier channels at discounted rates) Rgds, Gary |
Hi Tom,
1) Regarding the AI image compression. Is it possible that there wouldI never got such strange behavior with HIFIC, but it is a matter of us trying out. Indeed, the performance is not orders of magnitude better than VVC (H.266) - by reading the papers. But in my experiences, with the lower bitrate HIFIC (0.160 bpp) is better than state-of-the-art traditional image encoding (aka. VVC). Indeed, images look much better (do not look the PSNR thou). Use the? "hific-lo" model in (I'm using the CPU version in my old Lenovo T430, and for still images, you just need at least 16 GB ram to play with and get the results after less than 30s!): 2) Something I've toying around with and trying to find any possibleCertainly yes. An AI NN trained for such purpose could reconstruct a good quality voice from SSB quality "Donald Duck" analog voice. Noise is the most challenging thing here as far as I played with. I wonder how much it is going to drain my bank account setting up an:P In the university we have some nice GPUs to play with. If you really want to play with this, I bet nothing, as there will be many people offering you GPU access to play with such computer science edge stuff. Look forward to your paper.Yay! Cheers, Rafael |
Hi Farhan,
toggle quoted message
Show quoted text
Do you have any "traditional" automatic CW decoder reference for performance comparison? I hope to get something better than what exists, otherwise it does not make any sense. Rafael On 10/21/21 3:10 AM, Ashhar Farhan wrote:
Rafael, |
to navigate to use esc to dismiss