7 comments

  • codingdave 22 minutes ago
    > If you haven’t spent at least $1,000 on tokens today per human engineer, your software factory has room for improvement

    At that point, outside of FAANG and their salaries, you are spending more on AI than you are on your humans. And they consider that level of spend to be a metric in and of itself. I'm kinda shocked the rest of the article just glossed over that one. It seems to be a breakdown of the entire vision of AI-driven coding. I mean, sure, the vendors would love it if everyone's salary budget just got shifted over to their revenue, but such a world is absolutely not my goal.

    • philipp-gayret 3 minutes ago
      $1,000 is maybe 5$ per workday. I measure my own usage and am on the way to $6,000 for a full year. I'm still at the stage where I like to look at the code I produce, but I do believe we'll head to a state of software development where one day we won't need to.
      • dixie_land 12 minutes ago
        This is an interesting point but if I may offer a different perspective:

        Assuming 20 working days a month: that's 20k x 12 == 240k a year. So about a fresh grad's TC at FANG.

        Now I've worked with many junior to mid-junior level SDEs and sadly 80% does not do a better job than Claude. (I've also worked with staff level SDEs who writes worse code than AI, but they offset that usually with domain knowledge and TL responsibilities)

        I do see AI transform software engineering into even more of a pyramid with very few human on top.

        • bobbiechen 7 minutes ago
          Important too, a fully loaded salary costs the company far more than the actual salary that the employee receives. That would tip this balancing point towards 120k salaries, which is well into the realm of non-FAANG
        • dewey 18 minutes ago
          It would depend on the speed of execution, if you can do the same amount of work in 5 days with spending 5k, vs spending a month and 5k on a human the math makes more sense.
          • iBelieve 8 minutes ago
            By today, do they mean _per day_, or just _total to date_ per engineer?
            • kaffekaka 18 minutes ago
              If the output is (dis)proportionally larger, the cost trade off might be the right thing to do.

              And it might be the tokens will become cheaper.

            • wrs 5 minutes ago
              On the cxdb “product” page one reason they give against rolling your own is that it would be “months of work”. Slipped into an archaic off-brand mindset there, no?
              • japhyr 30 minutes ago
                > That idea of treating scenarios as holdout sets—used to evaluate the software but not stored where the coding agents can see them—is fascinating. It imitates aggressive testing by an external QA team—an expensive but highly effective way of ensuring quality in traditional software.

                This is one of the clearest takes I've seen that starts to get me to the point of possibly being able to trust code that I haven't reviewed.

                The whole idea of letting an AI write tests was problematic because they're so focused on "success" that `assert True` becomes appealing. But orchestrating teams of agents that are incentivized to build, and teams of agents that are incentivized to find bugs and problematic tests, is fascinating.

                I'm quite curious to see where this goes, and more motivated (and curious) than ever to start setting up my own agents.

                Question for people who are already doing this: How much are you spending on tokens?

                That line about spending $1,000 on tokens is pretty off-putting. For commercial teams it's an easy calculation. It's also depressing to think about what this means for open source. I sure can't afford to spend $1,000 supporting teams of agents to continue my open source work.

                • CubsFan1060 14 minutes ago
                  I can't tell if this is genius or terrifying given what their software does. Probably a bit of both.

                  I wonder what the security teams at companies that use StrongDM will think about this.

                  • g947o 14 minutes ago
                    Serious question: what's keeping a competitor from doing the same thing and doing it better than you?
                    • simonw 8 minutes ago
                      That's a genuine problem now. If you launch a new feature and your competition can ship their own copy a few hours later the competitive dynamics get really challenging!

                      My hunch is that the thing that's going to matter is network effects and other forms of soft lockin. Features alone won't cut it - you need to build something where value accumulates to your user over time in a way that discourages them from leaving.

                      • CubsFan1060 2 minutes ago
                        The interesting part about that is both of those things require some sort of time to start.

                        If I launch a new product, and 4 hours later competitors pop up, then there's not enough time for network effects or lockin.

                        I'm guessing what is really going to be needed is something that can't be just copied. Non-public data, business contracts, something outside of software.

                    • rhrthg 34 minutes ago
                      Can you disclose the number of Substack subscriptions and whether there is an unusual amount of bulk subscriptions from certain entities?