What also cannot be ignored, is that transformer models are a great unifying force. It's basically one architecture that can be used for many purposes.
This eliminates the need for more specialized models and the associated engineering and optimizations for their infrastructure needs.
This eliminates the need for more specialized models and the associated engineering and optimizations for their infrastructure needs.