Making Wolfram Tech Available as a Foundation Tool for LLM Systems

(writings.stephenwolfram.com)

78 points | by surprisetalk 5 hours ago

8 comments

  • Davidzheng 12 minutes ago
    There's a lot of value in the implementation of many strong and fast algeorithms in computer algebra in proprietary tools such as Maple, Wolfram, Matlab. However, I (though of course believe that such work needs to be compensated) find it against the spirit of science to keep them from the general public. I think it would be good service to use AI tools to bring open source alternatives like sympy and sage and macaulay to par. There's really A LOT of cool algorithms missing (most familiar to me are some in computational algebraic geometry)

    Additionally I think because of how esoteric some algorithms are, they are not always implemented in the most efficient way for today's computers. It would be really nice to have better software written by strong software engineers who also understands the maths for mathematicians. I hope to see an application of AI here to bring more SoTA tools to mathematicians--I think it is much more value than formalization brings to be completely honest.

  • nphardon 2 hours ago
    There's a great discussion with Stephen Wolfram on the Sean Carroll podcast. Listening to it made me think very highly of Wolfram. He's a free thinking, eccentric, mathematician, scientist; who got started doing serious work at a very young age. He still has a youthful creative approach to thought and science. I hope LLMs do pair well with his tools.
    • lioeters 36 minutes ago
      To save others a search, here's the podcast with Wolfram.

      Stephen Wolfram on Computation, Hypergraphs, and Fundamental Physics - https://podbay.fm/p/sean-carrolls-mindscape-science-society-... (2hr 40min)

      I'm a fan of his work and person too. Not a fanatic or evangelical level, but I do think he's one of the more historically relevant computer scientists and philosophers working today. I can overlook his occasional arrogance, and recognize that there's a genuine and original thinker who's been pursuing truth and knowledge for decades.

    • kylecazar 1 hour ago
      He live streams the (internal) Wolfram Alpha product meetings on YouTube. It's really interesting to watch, I've been a fly on the wall for years.
      • lcdryuga1983 1 minute ago
        I tried finding this but couldn't find them on youtube. Can you please share the link for one of the videos?
      • nphardon 1 hour ago
        I knew about this but never attended, so cool!
    • jadbox 2 hours ago
      I'm fairly certain Stephen Wolfram will be one of the few intellectuals today that will still be remembered in 50 years.
  • ddp26 1 hour ago
    I tried using wolfram alpha as a tool for an llm research agent, and I couldn't find any tasks it could solve with it, that it couldn't solve with just Google and Python.
    • nradov 26 minutes ago
      Well sure, in theory any mathematical problem can be solved with any Turing complete programming language. I think the idea here is that for certain problem domains Mathematica might be more efficient or easier for humans to understand than Python.
  • qrios 1 hour ago
    A simple skill markdown for Claude Code was enough to use the local Wolfram Kernel.

    Even the documentation search is available:

    ```bash

    /Applications/Wolfram.app/Contents/MacOS/WolframKernel -noprompt -run '

    Needs["DocumentationSearch`"];

    result = SearchDocumentation["query term"];

    Print[Column[Take[result, UpTo[10]]]];

    Exit[]'

    ```

  • skolos 3 hours ago
    I like Mathematica and use it regularly. But I did not see any benefits of using it over python as a tool that Claude Code can use. Every script it produced in wolfram was slower with worse answers than python. Wolfram people are really trying but so far the results are not very good.
    • mr_mitm 2 hours ago
      Back when I was using it, mathematica was unmatched in its ability to find integrals. Has python caught up there?
      • currymj 2 hours ago
        sympy is good enough for typical uses. the user interface is worse but that doesn't matter to Claude. I imagine if you have some really weird symbolic or numeric integrals, Mathematica may have some highly sophisticated algorithms where it would have an edge.

        however, even this advantage is eaten away somewhat because the models themselves are decent at solving hard integrals.

        • tptacek 2 hours ago
          I've always sort of assumed the models were just making sympy scripts behind the scenes.
          • currymj 1 hour ago
            sometimes you can see them do this and sometimes you can see they just work through the problem in the reasoning tokens without invoking python.
          • cyanydeez 2 hours ago
            Wheres Godel when you need him. A lot of this stuff is symbol shunting, which LLMs should be really good at.
    • ai-christianson 2 hours ago
      What do you think the problem is?
      • owyn 2 hours ago
        I think the problem is just not enough training on that specific language because it's proprietary. Most useful Mathematica code is on someone's personal computer, not GitHub. They can build up a useful set of training data, some benchmarks, a contest for the AI companies to score high on, because they do love that kind of thing.

        But for most internet applications (as opposed to "math" stuff) I would think Python is still a better language choice.

  • maxdo 1 hour ago
    CAG sounds like fake solution for LLM's. Math problems are not custom data, they are limited in amount, and do not refresh like product manuals.

    Hence math can always be part either generic llm or math fine tuned llm, without weird layer made for human ( entire wolfram) and dependencies.

    Wolfram alpha was always an extra translation layer between machine and human. LLM's are a universal translation layer that can also solve problems, verify etc.

  • petcat 3 hours ago
    Sounds cool.

    Aside, I hate the fact that I read posts like these and just subconsciously start counting the em-dashes and the "it's not just [thing], it's [other thing]" phrasing. It makes me think it's just more AI.

    • mr_mitm 3 hours ago
      If there is one person who likes to hear himself talk too much to use AI, it's got to be Stephen Wolfram.
      • jacquesm 2 hours ago
        It's like Stephen Wolfram, only now there is 10x more of it...
    • gnatman 3 hours ago
      If you go back to a random much older post you’ll find emdashes aplenty.

      e.g. https://writings.stephenwolfram.com/2014/07/launching-mathem...

      • _alaya 2 hours ago
        Plot twist - AI reasoned that Stephen Wolfram actually was the smartest human and thus chose to emulate his writing style.
    • porcoda 8 minutes ago
      The em-dash metric is silly. Some people (including me) have always used them and plan to continue to do so. I just pulled up some random articles by Wolfram from the before-LLM days and guess what: em-dashes everywhere. One sample from 2018 had 89 of them. Wolfram has always written in the same style (which, admittedly, can be a bit self-aggrandizing and verbose). It’s kinda weird to see people just blowing it off as AI slop just because of a —.
    • llbbdd 3 hours ago
      The other day I formatted a sentence out loud in the "it's not just x it's y" structure and immediately felt gross, despite having done it probably a million times in my lifetime. That was an out-of-body feeling.
      • nerevarthelame 28 minutes ago
        In George Orwell's essay "Politics and the English Language," [0] one of his primary recommendations for writing well is to "Never use a metaphor, simile, or other figure of speech which you are used to seeing in print."

        "It's not just X, it's Y" definitely seems to qualify today. It's a stale way to express an idea.

        I hadn't revisited that essay since LLMs became a thing, but boy was it prescient:

        > By using stale metaphors, similes, and idioms [and LLMs], you save much mental effort, at the cost of leaving your meaning vague, not only for your reader but for yourself ... But you are not obliged to go to all this trouble. You can shirk it by simply throwing your mind open and letting the ready-made phrases come crowding in. They will construct your sentences for you — even think your thoughts for you, to a certain extent — and at need they will perform the important service of partially concealing your meaning even from yourself.

        [0]: https://bioinfo.uib.es/~joemiro/RecEscr/PoliticsandEngLang.p...

      • zamadatix 2 hours ago
        When I notice that I change it to "it's y, not just x" just to catch others off guard :).
        • MillionOClock 1 hour ago
          Oh no! Now it's going to be in the training dataset :'(
    • sdeiley 2 hours ago
      There are dozens of us that used them before AI! Dozens!
    • nubg 1 hour ago
      Thank you from saving me a click and my brain from consuming AI slop by a person who cannot be bothered to use their own damn words.
    • scoot 3 hours ago
      LLMs use the em-dash excessively but correctly. This post is littered with them in places they don't belong which makes it look decidedly human, as if written by someone who believes that random em-dashes make their writing look more professional, while actually having the opposite effect.
      • metabagel 1 hour ago
        > This post is littered with them in places they don't belong

        Does he speak the same way - pausing for emphasis?

      • arjie 3 hours ago
        It's Stephen Wolfram, mathematician and computer scientist. This is how he portrays himself https://content.wolfram.com/sites/43/2019/02/07-popcorn-rig1...

        Somehow I don't think "trying to make my writing look professional" is very high on the priority list.

    • keybored 2 hours ago
      If you really want to know: more than one emmy-dash per paragraph is probably excessive.

      > LLMs don’t—and can’t—do everything. What they do is very impressive—and useful. It’s broad. And in many ways it’s human-like. But it’s not precise. And in the end it’s not about deep computation.

      This is a mess. What is the flow here? Two abrupt interrupts (and useful) followed by stubby sentences. Yucky.

      • written-beyond 2 hours ago
        Idk about the grammatical correctness of the punctuation, but I really enjoyed reading his writing. Never read something by him before, it was genuinely refreshing, specially given it was a glorified ad.
      • metabagel 1 hour ago
        It's a conversational writing style.
  • peter_d_sherman 1 hour ago
    >"But an approach that’s immediately and broadly applicable today—and for which we’re releasing several new products—is based on what we call

    computation-augmented generation, or CAG.

    The key idea of CAG is to inject in real time capabilities from our foundation tool into the stream of content that LLMs generate. In traditional retrieval-augmented generation, or RAG, one is injecting content that has been retrieved from existing documents.

    CAG is like an infinite extension of RAG

    , in which an infinite amount of content can be generated on the fly—using computation—to feed to an LLM."

    We welcome CAG -- to the list of LLM-related technologies!