This library bridges the high-performance Metal Flash Attention implementation to other programming languages through a clean C API. It maintains zero-copy semantics by working directly with Metal ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results