The Mill CPU

10 responses to “The Mill CPU”

Svilen Kanev says:

July 31, 2014 at 5:52 pm

Couldn’t agree more. The Itanium comparison does come to mind big time.
More philosophically, as many people dealing with architecture (myself included) have noticed, a lot of things look good on paper until you actually simulate the needy-greedy details.

LikeLike

Reply
Common Eggcorn says:

March 14, 2015 at 5:06 pm

> until you actually simulate the needy-greedy details.

ITYM “nitty gritty”? I’ve not seen that eggcorn before.

LikeLike

Reply
ian says:

March 14, 2015 at 8:40 pm

You bet it’s for real! And wide open for you to see. If you’d had any knowledge in the field you’d see the violent beauty of their design. Now you can’t see beyond the FUD you are spreading.

LikeLike

Reply
Hey says:

March 15, 2015 at 1:35 am

I’m pretty sure you mean nitty-gritty.

LikeLike

Reply
Paul Keeble says:

March 15, 2015 at 11:39 am

The CPU could be fabricated with an PGA of some description, it would be slower and potentially smaller in some way but prototype hardware shouldn’t be inaccessible to almost everyone. Once such a thing exists its possible to show the performance benefits with your basic compiler for the language and such. I do get in todays age that people want money before they have worked out how to make the thing they want to make and sell, but practically when it sounds like snake oil you need to do the moderate amount of heavy lifting yourself to prove your concept. That isn’t going to happen if they spend most of their time presenting it in conferences.

LikeLike

Reply
Ivan Godard says:

March 16, 2015 at 1:36 am

Your skepticism is completely justified. The Mill may never reach market – we are a startup, and most startups fail; its a fact of life. Although we’ve survived for over a decade, which is pretty good for startups these days.

But it sounds like you are less skeptical about Mill Computing the company, but more about Mill the technology and architecture. There are fewer ground to doubt that. As fast as we have been able to get the patents filed (I seem to have been doing nothing else for the last two years. I *hate* patents) we have been completely opening the kimono and showing the technical community, in detail, how each part works. Why? because we wanted outside validation before wasting another decade in something that was fatally flawed in some way we had overlooked.

If there was any part of the public Mill that one could point at and say “See? that won’t work, because …” then the web would have been all over us. Buy you know? Skepticism we get, aplenty. What we don’t get is *informed* skepticism. In fact, the more senior and skilled the commenter, the more they fall in love with the design. Like Andy Glew said one time (and if you don’t know who that is then you are not in the CPU business) – “Yeah, it’ll work, just the way he says it will”.

Sometimes people complain that our presentations are insufficiently detailed to fairly evaluate. Guilty as charged; they are oriented for a high level audience interested in the subject, but not for the specialist. However, if you ask for details on our forum (mill computing.com/forum/themill) or the comp.arch newsgroup, as hundreds have, you will get all the details you want until they flood out your ears and collect in puddles on the floor.

In these days of internet time, when idea to market is measured in days or weeks, it’s east to forget that not all the economy works that way. Building steel mills, cement plants, and yes, CPU silicon takes a *long* time and a *lot* of money. We have deliberately swapped money for time: we are a bootstrap startup, not looking for VC funding. There’s good and bad in that choice: a decade without a paycheck is not easy, but today we own it – all of it – and feel we got a good deal.

The proof of the Mill pudding will be when there’s a product with pins on the bottom, and that won’t happen for some years yet. We learned in our first presentation not to make projections of what the eventual chip will have for numbers. Yes, we have guesstimates internally, but we’re quite sure those will be off by a factor of two. The problem is that we have no clue which direction they will be off.

If you have the technical chops to understand a CPU design from first principles then please dig as deep as you can into our stuff and tell us – and the world – what you find. Otherwise you will just have to join us as we wait and work and see. We’ve never said anything different.

Ivan

LikeLike

Reply
- Kevin Modzelewski says:
  
  March 16, 2015 at 6:30 pm
  
  Hi Ivan, thanks for the response! I never expected my post to gain much traction let alone from the source itself.
  
  I can definitely appreciate the reasons for talking about something before it happens; we made the decision to announce Pyston and start talking about it well before it was ready. I also agree with some of the comments I’ve seen elsewhere that even if you happen to never build any chips, a fresh CPU design that’s as thought out as this will definitely influence CPU design going forward. And that might even be a large part of your goal (compared to building something just for the goal of being a commercial success), which I can definitely get behind. I think if you were a bit more clear about the state of the project and the extent to which your ideas have been validated, it could help prevent readers like me from getting the wrong idea about what you’re getting at. I think this post of mine, as well as some other peoples’ reactions I’ve seen, are somewhat borne out of feeling mislead about what exactly The Mill CPU is at the moment.
  
  But regardless, you’re attempting something crazy ambitious and game-changing, which I have to respect and root for 🙂
  
  LikeLike
  
  Reply
Lukasz Mielicki says:

April 2, 2015 at 9:27 am

It’s nice to see something new in the field but static scheduling has failed to deliver performance several times in the history. Take the i960 as an example. In the real life every fourth instruction is a jump.
Executing code in two directions? Doubles the chance of cache miss.
Single address space? Good luck with implementing fork call.
Post-compilation on the target? Fine, if you never need to debug…
Overly complicated architectures don’t seem to ever succeed and no compatibility with existing software makes me rather skeptical. Anyway, good luck!

LikeLike

Reply
- NXTangl says:
  
  February 19, 2018 at 6:50 pm
  
  Executing code in two directions may double the chance of a miss, but executing out of two separate caches lets you double I$ size without increasing latency from speed-of-light delays, which halves the chance of a miss.
  
  Also, further talks have revealed some of the things the mill can do:
  
  – Pack short loops into a single parallel instruction, yielding a loop throughput of one iteration per cycle.
  – Use smear operators to effectively vectorize while loops.
  – Use selection operators to reduce the amount of branches needed.
  – Save caller registers lazily in parallel with callee code, making functions cheap.
  – Implement fork() with what’s basically a segment offset into virtual memory.
  – A bunch of ops that make stack frames cheap.
  
  LikeLike
  
  Reply
  - Kevin Modzelewski says:
    
    February 21, 2018 at 1:37 pm
    
    It certainly sounds very impressive! If they can build half the things they promise it will be game-changing.
    
    LikeLike