lukax 2 hours ago

Is this Triton's reply to NVIDIA's tilus[1]. Tilus is suposed to be lower level (e.g. you have control over registers). NVIDIA really does not want the CUDA ecosystem to move to Triton as Triton also supports AMD and other accelerators. So with Gluon you get access to lower level features and you can stay within Triton ecosystem.

[1] https://github.com/NVIDIA/tilus

ericdotlee 39 minutes ago

Why is zog so popular these days? Seems really cool but I have yet to get the buzz / learn it.

Is there a big reason why Triton is considered a "failure"?

ronsor 3 hours ago

The fact that the "language" is still Python code which has to be traced in some way is a bit off-putting. It feels a bit hacky. I'd rather a separate compiler, honestly.

  • JonChesterfield 2 hours ago

    Mojo for python syntax without the ast walking decorator, cuda for c++ syntax over controlling the machine, ah hoc code generators writing mlir for data driven parametric approaches. The design space is filling out over time.

  • zer0zzz 10 minutes ago

    This is pretty common among these ml toolchain, and not a big deal. They use pythons ast lib and the function annotations to implement an ast walker and code generator. It works quite well.

  • derbOac 2 hours ago

    Yeah that struck me as odd. It's more like a Python library or something.

    • zer0zzz 4 minutes ago

      It’s a dsl not a library. The kernel launch parameters and the ast walk generate ir from the Python.