Artificial Intelligence Discussion

NormalDan · January 8, 2025, 6:23pm

I’ve recently obsessed about my commuting options, trying to figure out the optimal route. It’s a little nuanced in that it matters where/how you park so you can get on the train, etc. (Could even get into reliability of train vs bus, traffic, frequency of trains, etc.) It’s the kind of exercise that I think resembles very basic level analytics work. I asked chatgpt for its thoughts on which commuting option from A to B was best and it was awful. Better off just doing directions on google maps and taking the first one.

I also learned if you ask it to show you a picture of someone writing with their left hand it shows you a picture of a right handed person and lies.

magillaG · January 8, 2025, 7:01pm

I think this is probably oversimplified.

The problems you mention seem to be able to be “engineered” away. They are basically problems of overfitting. We can ease them by getting more data so that we have training and testing data, and using regularization or Bayesian methods to prevent over-fitting. Sometimes this will work, but not always.

For example it doesn’t cover the situation when you build a model which lets you understand the problem better, and realize you need fundamentally different kinds of data.

We can also consider Newtonian gravity over Ptolemy’s model. Both use the same training data, and both can be falsified. For example, the universality of Newton’s laws provide a different kind of preference, particularly when considered with the various observations of changes in the heavenly bodies (the supernova observed by Galileo for example).

SredniVashtar · January 8, 2025, 7:58pm

I’m not sure about general everyday ‘equation solving’, but Google used a similar notion to improve some algorithms. See AlphaTensor and AlphaDev. The improvements are small, but the algorithms are used frequently. Obviously, AlphaFold was the bigger thing to come from DeepMind lately.

But yes, chat-bots can’t reason. They can only try to predict the words we use when we talk about reasoning. (I think humans often do this as well-- though presumably not all of the time.) They can also try to translate their words and ideas into code, and then run the code. But they aren’t great at it regardless.

The newest generation of models that do pretty good essentially work through trial and error. It’s not quite simple-brute-force, but it is a lot of force. One part of them is generating ideas. Another part of them is critiquing those ideas. The two together come pretty far in problem solving-- though it’s getting computationally ridiculous.

Open AI’s last paper had a model that used >$1,000 of compute per problem.

I can’t speak to genius. My limited experience with creativity is that there’s also a lot of trial and error. You experimentally combine ideas, images, interpretations, equations, definitions, theorems, whatever. Then you test them to see whether your new ideas are any “good”.

I think a chatbot might be able to do those things one day, but we’ll have to wait until they can at least solve already solved problems.

SteveGrondin · January 8, 2025, 8:05pm

With a 200 mph fast ball, I’d take the 0.000 BA.

SredniVashtar · January 8, 2025, 8:33pm

Do major league pitches even need to bat??? I’ve never watched baseball in my life.

dr_t_non-fan · January 8, 2025, 8:36pm

DH is now universal, to the detriment of Base Ball.

magillaG · January 8, 2025, 9:21pm

It’s surprising to me that Fortran was faster than C++ until the mid 2010s or so. Apparently this is because the more limited expressiveness of Fortran allowed for additional optimizations to matrix calculations. C++ caught up using special “template” techniques. The AlphaTensor reminds me of a fancier version of that. It is interesting to hear about it, I don’t think I knew about that particular case.

Mathematica can do more general symbolic calculations. I hope a deep NN speed up is coming soon (or maybe is in a latest version I haven’t seen). It would be more broadly applicable.

A deep problem is that we don’t really know what reason is, fully.

A popular AI textbook tries to pragmatically define intelligence as “doing the right thing”. Unfortunately, nobody knows what the right thing is. Mathematicians are going to tend to have their own idea, influenced by the values that helped them become successful in their field.

One thing is for sure- it isn’t modeled on how people actually think and make decisions.

The reason I bring it up is because we have statements like the one made by elon musk a few days ago (as I recall) that probably by next year, we will have a computer that is more intelligent than any person currently alive. I think it is fair to respond to claims like that with references to genius.

It makes me wonder what kind of “intelligence” Musk and his kind are really looking for. True genius tends to be terribly subversive of existing power structures: academic, political, religious, artistic, etc. Maybe it should be no surprise if so many of these AI execs envision perfect intelligence as a kind of perfect drone worker. Somebody who can take data and, without any motivation or values of its own, return a product that can be sold.

SteveGrondin · January 8, 2025, 11:00pm

The kind that he can work 80 hrs a week for low pay and harvest the economic results from while professing to hold to intangible altruistic goals.

So your last 2 sentences, yes.

magillaG · January 9, 2025, 12:36am

Finding people like this is one of musk’s undeniable talents.

OldTimer · January 9, 2025, 1:30am

Gothamchess is covering a chess tournament with Stockfish, Martin and 6 AI engines. The first one was posted today featuring Stockfish and Snapchat. Snapchat performed many illegal moves, but Levy let them be played. Fun watch. https://www.youtube.com/watch?v=CZGs4g_hVco&ab_channel=GothamChess

Fish_Actuary · January 9, 2025, 3:00am

That seems to be happening a lot these days…

SredniVashtar · January 9, 2025, 3:25pm

Wow. That one was a nailbiter!

Here is how GPT o1-preview deals with stockfish:

Easy-peasy.

https://x.com/PalisadeAI/status/1872666169515389245

magillaG · January 9, 2025, 3:50pm

I think we need to consider the moves seriously but not literally.

Surely all will be well …

OldTimer · January 9, 2025, 4:03pm

It was funny to see Snapchat move the king onto the same square as the bishop, move the knight twice in one move, have the queen move as a knight, etc.

dr_t_non-fan · January 9, 2025, 4:07pm

Like playing with a kid who knows grandpa won’t say anything. Then the kid throws the board into the air ands stomps off after losing.
Will this AI grow up to be a brat or will it mature? This AI might have digested too much Calvin and Hobbes. No not the philosophers.

SredniVashtar · January 10, 2025, 2:12pm

Gemini:
“…10. D4, also I see that you’re chatting with me from the Bronx.”

Wanted to add that there is a long standing mystery in the nerd community that ChatGPT-3.5 Turbo Instruct can play chess.

Nobody knows why or how, as ChatGPT-3.5 Turbo Instruct is otherwise a garbage LLM that nobody uses anymore.

Indy · January 27, 2025, 3:49pm

It looks like Nvidia closed at $142 on Friday and opened at $125 today.

Fish_Actuary · January 28, 2025, 3:38am

I bought some NVIDIA shares today… I’m not sure the news is as bad for them as people think. With the Chinese AI results, yes less computing power is needed to do about the same job. On the other hand, I think this opens up both more powerful models sooner than we expected and it potentially opens up AI development to far more groups given that it’s substantially less cost intensive to do this kind of work.

Vorian_Atreides · January 28, 2025, 1:44pm

I’ve been meaning to post this for a while, and just now got a round tuit.

At any rate . . . I’m wonder just how much of the AI algorithms for presenting things in venues like social media is contributing to the increasing polarization in the US. Most of the time, I get feeds that show me “news articles” based on what I’ve looked at.

That is, I’m getting fed more of the same stuff I’ve already seen.

I don’t think ANY news-feed algorithm has the capacity to show me “something different”. That is, articles on some of the same topics that I’ve already looked at but from a different perspective.

Fish_Actuary · January 28, 2025, 3:01pm

Wouldn’t you sort of get that with the trending articles? Or is it a self reinforcing thing where we all shift towards the same trending articles regardless of personal politics? Perhaps like the concerns people have with passive ETFs.