Oh man I was playing with Mycroft and Mozzilla's Deepspeach back in the day just for this. Though honestly a free desktop supported API that apps could integrate still seems like the best way for this. The next one would be getting Voice User Interface (VUI) support into major frameworks so it's just native to apps built with major frame works. The latter makes more sense AFTER the desktop API starts getting standardized.