Skip to content
SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression | Frontier Pulse