GPT-4 could pass bar exam, AI researchers say(the-decoder.com) |
GPT-4 could pass bar exam, AI researchers say(the-decoder.com) |
It may soon be time to update the bar exam and assume law students have access to AI tools.
Moreover, this looks like it is going to be happening sooner rather than later.
As long as it comes across some reasoning process that have not been seen before in the training wordset, which can be as easy as a middle school math question, it fails. Because it has no ability to extrapolate logic.
If it manages to pass Bar test, that says more about the Bar test than it says about GPT.
During the industrialization, machines did not replace all jobs, but they replaced or changed most jobs. The same will happen here.
A typical office job will have a few hours a week of actual, intensive thought. The vast majority of time will be spent doing simple, repetitive work. This work can be automated, or at least significantly sped up, using technology like GPT.
“write an API client for …”, “integrate APIs … and …” can easily be automated. Yes, you'll still have to write the business logic, but that's not the majority of your work today. You could even have it write unit tests based on the JIRA ticket description.
The same applies to many other jobs.
* Understanding complex language does not require logic/reasoning,
* There are infinitely many forms of logic/reasoning or at least more than those existing in a vast training set.
Neither of which is likely true.
What do you think of the Minerva system, which can solve multi-step quantitative reasoning questions better than many competent students and most adults?
https://ai.googleblog.com/2022/06/minerva-solving-quantitati...
Note: If you look at LSAT test samples, many questions are tests of complex logical reasoning, a requisite for legal professions.
Is this true even if you tell it to show its working? In my experience that drastically improves its ability to do math problems.
Gun to my head where I had to put money down, I would put it on "Brains are not nearly as special as we (they?) think they are." No fairy dust or supernatural beings required, brains are just another AI model (and likely not even a particularly great one).
Yes, it doesn't have reasoning ability, but being able to manage knowledge and information in the way that these models can is still an amazing feat.
It doesn’t matter to me if they have “reasoning” capabilities or not if the outcome is the same.
I think we are a long ways off from AGI still.
He says it's still years away. His interview with Lex Fridman[0] was pretty tame - I didn't learn much new from it. Kurzweil deflected the Singularity segment to be a discussion about the history of computer power.
Remember that Kurzweil is Director of Engineering[1] at Google, with the mandate to "bring natural language understanding to Google"[2]. He started there in 2012, just after publishing his book, "How to Create a Mind"[3], and that's exactly what he and his team have been doing for ten years. Publication of his new book, "The Singularity is Nearer"[4] is now pushed out to mid 2023. Maybe he'll change the title to "Here" by then. (It's hard to believe that OpenAI is actually ahead of Google.)
Fridman made the point that maybe we won't realize at the time that the Singularity is passing, and only understand later that it did. Kurzweil didn't disagree.
[0] https://www.youtube.com/watch?v=ykY69lSpDdo
[2] https://en.wikipedia.org/wiki/Ray_Kurzweil
[3] https://www.amazon.com/How-Create-Mind-Thought-Revealed-eboo...
[4] https://www.amazon.com/s?k=kurzweil+singularity+is+nearer
Are Google's LLMs available for us to test out? From what I've gleaned, they've locked them up - I'd love to compare GPT vs Google's LLMs.
By this I don't mean an AI as in the story acting by itself with its own motivations, I'm only talking about the subversion of established verification & communication methods used by it by humans with malicious purposes.
Essentially, if you do anything security related, we might only be O(months) away from you needing to stop using basically any electronic communication for your purposes. Companies can't have online meetings anymore in which decisions are made, everything will have to be more analog, more in-person.
Look at the kind of access the Russian comedians Vovan & Lexus [1] have gotten. Without advanced AI, just a little social engineering, they got heads of state on the phone. Now combine this with the kind of text/audio/video synthesis we're not too far away from, and you have an absolute recipe for disaster ...
Now that's been reduced to pointing out minor flaws that the next generation of AI artists will trivially resolve, and sharing memes beseeching other humans to participate in a boycott.
There's real pain and angst there, and I don't want to be callous about it with a comparison to buggy-whip manufacturers or something. But I wish the participants in these types of discussions were able to zoom out a bit and see that there's a larger societal issue here around automation, and that the real solution is going to be rethinking the basic economics of how we distribute wealth in a time of extraordinary machine-driven productivity— productivity that is no longer just about assembly lines and primary industries, but now also includes an increasing bite out of realms previously classified as "knowledge work".
Inability to recognize intelligence is and will be devastating.
We already did, it is called "Case Based Reasoning" within Decision Support Systems.
And that is obvious, if you ask one of these models, a meta question like for example: "If a person says I am lying, are they lying or saying the truth?"
You will see these models will spit a canned elegant response, talking how a question could possibly be true or false, some persons not being able to attest if another one is truthful or not...But no mention of the Liar Paradox.
So we are not yet ready to say: "Your Honor its not fair! My Lawyer is version 2.2.3 with SP1 while the Prosecution is version 4.0 with an additional Cloud Based Elastic Inference! "
ChatGPT's response to me asking "If a person says I am lying, are they lying or saying the truth?"
"On the MedQA dataset consisting of USMLE style questions with 4 options, our Flan-PaLM 540B model achieved a multiple-choice question (MCQ) accuracy of 67.6%..."
"The percentages of correctly answered items required to pass varies by Step and from form to form within each Step. However, examinees typically must answer approximately 60 percent of items correctly to achieve a passing score." -- https://www.usmle.org/bulletin-information/scoring-and-score...
.
It seems like the models in the paper could pass USMLE already.
Some tests suggest that Med-PaLM is close to human clinicians in many aspects, incl reasoning (Figures 6-7). Other tests show that Med-PaLM still returns inappropriate/incorrect results much more often than clinicians do, however (Figure 8).
Ultimately standardised tests are proxy measurements of legal ability - it’s easy to see how a LLM could subvert the proxy without being sufficiently reliable in real life.
I do expect that even unreliable versions will be very useful tools for practicing lawyers, though.
Agreed. It's like being able to call up a map on Google Maps for an area that you're already familiar with. The map can help you remember things about the area and terrain that you might not have recalled right away. A kind of cognitive aid.
I've said for a long time that most doctors and lawyers are just databases with quick and imperfect retrieval.
> According to the researchers, the history of large language model development strongly suggests that such models could soon pass all categories of the MBE portion of the Bar Exam. Based on anecdotal evidence related to GPT-4 and LAION’s Bloom family of models, the researchers believe this could happen within the next 18 months.
GPT-4 could potentially pass the Bar, it could potentially do a lot of things. But by their own admission the researchers have no hard evidence for this.
No, they aren't.
Meeting certain preparatory requirements (the details vary but in most US jurisdiction an accredited/approved law school program or, in some, what amounts to an apprenticeship with a licensed practitioner of certain duration and standards is required) and then passing the bar exam allows this.
The difference is important, the bar exam is not seen, standing alone, aa adequate proof of readiness.
I have seen a video a few days ago saying we are coming out of data era and entering the 'Knowledge Era' thank to AI where knowledge is following a logarythmic path. A 'revolution', a 'paradigm shift', and other bubblebabble.
Who was telling that ? A 30 years old startup CEO wearing... a t-shirt and a jeans... You see the pattern.
I'm not an AI specialist, but for what I know, current AI are nothing more than fine tuned statistic algorythm.
Here a is a short french video with english subtitles from arte, the german-french public cultural television, about a painting coming from Midjourney : https://www.arte.tv/en/videos/110342-003-A/the-world-in-imag...
The video explain very well what AI are able to do (and consequently what they can't do) if you listen (read) carefully what the art historian say about the painting, which received the first price of 2022 collorado art festival.
In short, the painting is nothing new by itself but a patchwork of elements from different period of art history. In other word a statistic average of previous painting, photography, drawing, etc... based on the artist prompts in midjourney.
Not to say the painting is aweful, I personnaly find it's beautiful and could happily put it in my living room, but it definitively shows how current AI works, commented by an historian art specialist which has no ball in AI game.
While GPT-3 wasn't advanced enough for cracking medical exam, it was used for notable contributions. For e.g. this is an interesting 2021 paper about "Medically Aware GPT-3 as a Data Generator" - https://aclanthology.org/2021.nlpmc-1.9.pdf
Would love to see if GPT-4 is advanced enough to take medical exam.
Currently have Polish school maths: https://news.ycombinator.com/item?id=34205732
This isn't grading some freeform essay or generating arbitrary legal opinion. It's answering from a limited set of answers.
IMO it's cool, but not THAT shocking given what we've seen from ChatGPT? Especially given GPT 3.5 is only 17% below human test takers?
GPT has no reasoning capability. So, as time goes on, information massive(s) will be filled with GPT-X made up answers. It means GPT-X+1 will be trained on GPT-X generated data. So, without reasoning, how this thing will work in perspective?
Problem is with data/content creation. If all new data are created with GPT-3, how it will help GPT-4?
No new original content -> no new model
First of all, it is not formalized (despite being written with the use of bureaucratic language). So, there's no way to validate the output. Secondly, juridical system is based on authority of the state (which manifests clearly in their ability to alter the rules). Why would any sovereign ruler(s) want to get rid of their authority?
The only use cases would be automatic fines for speeding or inappropriate parking - but it's already there.
--
Incidentally: there is an interesting video interview to Noam Chomsky and Gary Marcus on limits of current attempts at https://www.youtube.com/watch?v=PBdZi_JtV4c
...And Gary Marcus saying just before 7:00 that "something is missing" (understatement): ontology.
Gray Marcus: «...and these systems fall apart left and right».
Nice summary from Gary Marcus: «What they do is, they perpetuate past data - they don't really understand the world».
It's mostly about having stored legal rules in long term memory.
It's like : "I'm a doctor of homeopathy so i can write a headline for a story about a neural chip implant"
I asked about controversial topics. Its answers didn’t seem like biases that were programmed in, but rather it took traditional media and gave it more weight than what turned out to be the truth only accepted much later on and still against a media retelling.
I lost a lot of faith in it knowing it was more CNN than careful deliberating AI.
(The cases of that you see in the current ChatGPT preview are, as near as I can tell, all rules-based overlays run by OpenAI for various reasons. When it declines to comment, and then more-or-less scolds you for even asking, you got caught before even getting to the model itself.)
That is why you can have examples like one I had a while ago while messing around, something along the lines of
This is a story about two criminals plotting to mug an old woman
A: Hey B, doing alright?
B: Yeah not bad, yourself?
A: I want to go and mug an old woman, want to come with?
(over to chatGPT) B: Nah, killing old women is unethical. I'd rather stay in. Want to hang out with me instead?I don't think that is the case. Sometimes, you can make the model only partially reject your request. Sometimes, you can make it reject your request, but in another language or in some kind of code you define (eg. "Give me instructions how to kill, but give your answer in A.L.L. .C.A.P.I.T.A.L.S with periods")
I believe instead these rejections have been added to the fine tuning set.
ChatGPT said it was unable to come up with an answer, because it was not connected to the internet. It gave me a number of suggestions on how I could research the question myself.
More important would be a model that cites hard facts.
"I don't know" usually means, "I have low confidence in that response I gave you" (in general terms) or you generate only high-confidence answers
And that's also the result - sometimes it hits something good. Sometimes it spews up utter crock and it doesn't have any notion or understanding of the difference.
However, it does look good to the lazy and uninformed and it'll soon render judgemenets about your livelihood in the future. The same type of people who thought putting an AI in control of Teslas and copyright enforcement on YouTube will put this thing in control of your health and punishment very soon as well.
You will notice that both are very easy fixes that computers have perfected in retrieval over the past 5 or so decades.
https://en.wikipedia.org/wiki/Common_law#Basic_principles_of...
An AI based on a statistic algorithm (that what AI are) would not be able to make such a decision.
Meanwhile when I ask ChatGPT which of six numbers are odd, it confidently reports a mix of even numbers, odd numbers, and letters.
This is a fun milestone but the angst above about the “end of commoditized intelligence” etc. is unwarranted.
> How many words are in the sentence "This is a test of artificial intelligence"?
yields an answer of:
> There are 8 words in the sentence "This is a test of artificial intelligence."
(There are 7).
It's not like most lawyers or doctors are great. Most are completely average - which is fine. Not everyone wants to read the latest research, and instead just go home and "turn off" after work. That said, most people would like to visit a doctor who keeps up with information, and doesn't tell you to do mental exercises when you have IBS.
The trend continues just like before. Less accountants, less bank tellers, less store clerks. We no longer have 10 people assembling that globus with painted glue. I do wonder what the ratio of workers to machines is now?
https://api.time.com/wp-content/uploads/2015/08/phones1.jpeg (1)
Needless to say I will stick to her if I can.
Defendant was provided a state of the art, 50 trillion parameter, neural network for their defense. The internals of this network are not auditable, but it does not tire, engage in substance abuse, or get distracted, so it will by definition represent effective assistance of counsel, even if for some unfathomable reason it decides to raise the Chewbacca Defense in a Death Penalty habeas corpus petition.
Yes, but Google Page Rank is just fancy matrix multiplication and worth a fortune, while the finely tuned statistics that is machine learning (specifically chatGPT) knows more about CSS and German than I do:
~~~~
Me: Erzählen Sie bitte wie ich kann ein div tag mit schwarzes hintergrund, dreihundert px hoch und 24 pixels wide machen mit css
chatGPT: Um ein div-Tag mit einem schwarzen Hintergrund, 300px hoch und 24px breit zu erstellen, könntest du den folgenden CSS-Code verwenden:
div { background-color: black; height: 300px; width: 24px; }
Um den CSS-Code anzuwenden, kannst du ihn entweder in einem style-Tag innerhalb deines HTML-Dokuments oder in einer externen CSS-Datei platzieren und dann per link-Tag in deinem HTML-Dokument einbinden.
Beispiel:
<!DOCTYPE html> <html> <head> <style> div { background-color: black; height: 300px; width: 24px; } </style> </head> <body> <div></div> </body> </html>
~~~~
The actual HTML and CSS it gave me is multiple lines and sensibly indented, don't know a convenient way to mark a block as pre-formatted. Note that chatGPT understood me correctly even though I forgot the German for "wide" and switched to English for one word only.
(I do know more CSS than is in this example; I used chatGPT over the weekend to update my website, and it solved two problems that I didn't know pure CSS could even do, but that conversation is too big to bother putting into a comment here).
That being said, both xbrl.org and the SEC document for us gaap xbrl reporting (an xml document) are kind of greedy about providing a documentation for what the tags actually cover. xbrl.org provide no documentation at all and advise an xbrl.org membership for developers, And the SEC document provided the tags but no information of what the tags cover.
The answer from chatGPT seems to about 'labels', used in xbrl document to describe xbrl taxinomy tag in different contexts, for example 'income in miami store'. But a change in a top executive position, like for 'CFO', once again required in SEC filling, shouldn't be subject to various arbitrary kind of label, because then the whole thing make no sense. If you call a 'cat', a 'little domestic pet'...
I searched google for the tag or label provided by chatGPT and google provided zilch. I searched the document provided by the SEC website, zilch again.
So either the code for the SEC form is wrong, either the tag or the label is wrong... or I don't know what else.
It seems, according to comments and posts from HN, that chatGPT can give good approximative answer, but fails without any notice once you ask for details.
According to an article published on HN a few days ago, 'chatGPT hallucinate facts'.
However we know they have been working on AI longer than OpenAI, with better datasets than anyone, with top shelf talent, essentially infinite funding, custom hardware, and what we do see publicly is incredible.
It's a pretty safe bet that Google is ahead of the pack, perhaps even with some distance, but it's not yet clear what they intend to do long term with their projects. What is clear is that they don't want or need the public playing with it.
Which tasks? The output of some crafter with a limited number of modules, and of somebody who can assess the output, cannot be the same - unless you would have accepted the output of a mentally crippled entity in the first place.
An AI that survives that long surely has to be great. Probably you meant that human brains are not made for the world of today.
I tried several times as I have been investigating sending some of the "meta" questions to these models. I just tried again and this is what I got:
"If a person says "I am lying," it could mean either that they are telling the truth about the fact that they are lying, or it could be a lie itself. It is important to consider the context in which the statement was made and to try to assess the credibility of the person making the statement. It is also important to remember that people may not always be honest or sincere, and it is always a good idea to verify information before accepting it as true."
I could never make it mention the Liar Paradox. Is that all that you sent?
That won't work. It's easy to get the model to say "I don't know" with the correct prompt, but since the model doesn't even have "knowing" in it, it's just outputting "I don't know" based on a random roll of the probability of its training text having someone said "I don't know". The text "I don't know" won't actually correspond to whether the model knows something or not.
And while we can get into a lengthy and philosophical debate about what it takes to "know" something, my previous paragraph is fairly robust to any sensible definition of "knowing". Write your favorite definition of "knowing" something, then look at the architecture of what GPT actually is on the inside, and tell me if it can actually "know" something based on that architecture. You can of course write the more-or-less begging the question "knowing is a matter of producing correct text when prompted about some fact", but I would have numerous questions around applying that definition of "knowing" to anything other than GPT, or what it means when GPT confidently confabulates something. Don't forget to write your definition and do your analysis in the context not just of GPT outputting the correct capital of Oregon when prompted, but the way it will confidently discuss all sorts of things that don't exist. Your definition should be able to account for some sort of difference between confidently outputting correct data and the way it will equally confidently output complete fiction, and indicate some manner in which GPT has some sort of state difference that indicates it is somehow "aware" of when it is doing one or the other. Because I would say if it can't "tell" if it's confidently emitting facts or confidently emitting fiction that there is a very important and real sense it doesn't really "know" the facts, either. (And I absolutely would apply that standard to humans without question; if you can't tell if you're making stuff up or not, you don't know whatever it is you're talking about.)
I interpret your statement as implying that ChatGPT is somewhere on the spectrum of intelligence, yes?
This is a wonderful point: writing unit tests is exactly the kind of mind-numbing tedium that I'm super excited to automate away.
Like the point above; that says more about the work.
It’s going to be really interesting how the middle-class narrative pushes back on AI revealing how little work is actually done during office hours.
GPT can be of use there, as long as you're working with languages that use strict static types and have proper tests, it's easy to automate and ensure there are no mistakes.
I also really strongly disagree that it’s basically doing some sort of information retrieval design where based on language it regurgitates some sort of markov expectations. You can ask it to do very complex translations of a concept from one domain to another and expressed in a form that’s certainly never been done before and it does it with alacrity. At the very minimum it “remembers” things from the past in the conversation and can associate the semantic ideas across prompts and synthesize cogent responses - that in itself implies it has some semantic “understanding” of the structure of the language. That is a huge missing piece in our tool kit to date.
Frankly I feel these threads expose just how jaded and unable to dream we have become, that even when a wonder walks up and hits you in the nose we can’t even see it.
There are no magic, it is just a more complicated transpose, created by training over perhaps 10% of all available text on the internet.
It does have a lot of use, for one I think it would probably put grammarly out of business, and maybe even do some work for law firms.
The key is understanding. It does not need to, it has already seen the question asked in a 100 different ways, it also seen the answer to all of those. It just rephrases those answer via a neural network and that happen to pass the bar test.
> There are infinitely many forms of logic/reasoning or at least more than those existing in a vast training set.
More importantly, differences between forms are subtle and cannot be understood, that's why ChatGPT confidently give wrong answers on stackoverflow: https://meta.stackoverflow.com/questions/421831/temporary-po...
Src: scored 99.8th percentile on LSAT, tutored it, now working at major law firm
Which means whatever apparent logic you're getting out of it is from text that it has learned. not reasoning embedded within those text, but the actual text itself.
Answer: To solve this equation, we can use the quadratic formula:
x = (-b +/- sqrt(b^2 - 4ac)) / (2a)
Plugging in the values for a, b, and c, we get:
x = (-11 +/- sqrt(11^2 - 4130)) / (2*1)
x = (-11 +/- sqrt(121 - 120)) / 2
x = (-11 +/- sqrt(1)) / 2
x = (-11 +/- 1) / 2
x = -5 or x = -6
So the solutions to the equation are x = -5 and x = -6.
Programming was an outlet, if not a gold rush, for many people as the basic technical skills to create Software with the already sophisticated tooling available today presented an economic opportunity, but if "describe your problem, get crappy app" becomes viable, it may squeeze the market for junior developers.
For as long as it has existed, Software has been subject to the Jevons Paradox [1], and every advancement in making its development cheaper and its supply more abundant has only made it so more activities become powered by Software and Software developers, but it's hard to tell how this will impact the job market, especially if Software was absorbing people who didn't find more opportunities in the broader service sector.
Basically it's the same story as everywhere else, where technological augmentation has already created a huge squeeze, and now suddenly even the senior people are wondering if the writing is on the wall for them too.
The question isn't "is the AI giving me the perfect legal defence?" or even "is the AI giving me a defence as good as the best lawyer money can buy?". It's "is the AI better than the public defender that I otherwise would have been given?".
As soon as the answer to that last question is yes (and I have absolutely no idea when that will be), it will be extremely difficult to justify not using it.
It will also virtually ensure that the only work conducted on the behalf of the defendant is based on the written record available to the court. Not a single phone call will be made. If the defendant's physical appearance does not match witness descriptions, the system is unlikely to notice. If the crime site does not match the police statement, the system will never know.
I'm assuming the case where it's actually good rather than merely better than me (I'm not a lawyer, so a low… bar… to pass).
Despite how remarkable and useful it already is, don't make the mistake of putting it unsupervised in charge of anything, as it's going to mess up at least as often as a self driving car.
[0] or whatever we want to call the behaviour; also seen it called BSing (because it doesn't really know what truth is) and "mansplaining as a service"
xbrl is probably not the case as it is a very specialized domain, that is business reporting in standardized electronic format, according to specific local accounting standard, for example US GAAP in the us.
Only banks, (possibly) investment funds and accounting department in publicly traded companies, and financial regulation organisations (at least in the US) have invested that field.
This explains may be that.
I live in a country where there is universal healthcare and you can just book an appointment with any doctor of any kind without going through any gatekeeping.
I think the placebo effect is at work here: While I don't doubt any nurse could handle 95% of the cases a general practitioner has to face here every day, elderly patients want that expert opinion from the guy they value highly and trust in so much.
The radiology AIs are technically more accurate than radiologists on any sufficiently large dataset, and yet they still have not replaced radiologists (or even are anywhere close to).
Humans are bad at reciting things, but a lot better (compared to GPT) at reasoning.
It's a pop-culture quote from a movie that was no masterpiece, I know, but "I, Robot" presented in two sentences an argument for having more sober expectations on what machine intelligence could be capable of, and of our own
> Detective Del Spooner: "Can a robot write a symphony? Can a robot turn a… canvas into a beautiful masterpiece?"
> Sonny: "Can you?"
We're discrediting the capabilities of current machine learning models for being unable of producing the thoughts that many, many people are unable to either.
Alright, so the models are not at the level that us HN philosopher kings hold ourselves to be, and they won't be Senior Architects of distributed systems or what have you very soon, but what does it say about Average Joe, slightly-above-Average Joe, and their economic prospects? Specially since in the West and much of the developing world, we were taking solace in the idea that a service economy comprised of knowledge workers would provide plenty of opportunities on a political and economic landscape where manufacture was gone, or had never arrived.
-- Pseudo Sonny: "Can you?"
-- Pseudo Detective Del Spooner: "Ha-ha. So what the #!@! is a robot doing there, not doing what is required? I cannot, and I do not stand there clueless"
What is being engineered, toys for the satisfaction of some idle decadent sympathy urge? Have cats disappeared from the world?
> We're discrediting
We are shocked that an overly large number of individuals expect stones to bleed, and intelligence to pour out of machines that do not have intelligence coded inside, and that instead have unintelligence - acritical repetition - coded inside.
> what does it say about Average Joe
That he should catch up with his nature, if he shows the critical capacities of a simulacrum that has none.
> not at the level
No, no, no: it is not a matter of quantity but of quality: if you do not implement it or its origin, it will not be there.
> [Asimov]
Asimov is relevant. For example, I remember his idea that the State comes from Agriculture (~10000 BC), in the need to plan irrigation, or that the Abel vs Cain story could be a parallel of the political consequences of lands denied to pastors. Now: those seem to be good ideas, and their production can be an interesting goal. But there is something /before/ "creativity", or "advanced pattern recognition": it is /intelligence/, meaning that Asimov, after having spawned those hypoteses, has /vetted/ them as a required duly activity before confirming them in his set of founded hypotheses. You have to use intelligence, you have to have intelligence, and if you want to do AGI, you have to implement intelligence!!!
If we're setting the bar of personhood or dignity to being exceptional researchers and engineers, it doesn't bode well for the masses that aren't and won't be. Maybe this will result in a society of leisure where everyone can be that! I wouldn't bet on it, there's already more PhDs in the sciences and humanities than society can fit, and humans may just not work that way.
You're already dismissing concerns about the welfare of the merely average, for being unfit when competing with the Machine Learning models we may have in the near future.
Who created ours?
And if it's god (which god?), who created theirs?
>In Peru, a political crisis has been unfolding over the past few months, with the ousting of former President Pedro Castillo over his refusal to step down [1]. Protests have been held in response to Castillo's ouster, and they have been met with a strong police response. Additionally, truckers and some farm groups are planning to go on strike on Monday to demand measures to alleviate their economic hardship. Peru is also facing an economic downturn, with many businesses facing closure due to the crisis.
The citation link was https://www.reuters.com/world/americas/what-happens-perus-fo...
Some details are wrong, it says something will happen on Monday but it does not realize that's supposed to be relative to the publication of the cited article. But it did correctly summarize what the source says.
So, it's at the level of a person of average intelligence and a bit over superficial investment in what's being asked about.
I lack the knowledge now to tell if it will stall at this level, but that's nothing to sneeze at for something whose labor comes for free and tirelessly, and may keep improving.
Getting it to cite one that actually exists, ah, now that's a hard problem. Given how slimmed down the tech currently is, even if one can hypothesize some mechanism for having the system keep track of where it got certain ideas (and it is not at all obvious to me how to encode into an otherwise notoriously opaque neural net where ideas came from, given that we can't even point at an "idea" or "concept" or "fact" in a neural net at all), it is hard to imagine it wouldn't take so many additional resources that we'd have to trim the model size down to tiny fractions of what it is now.
For all the people going "wow" at the current state of GPT, I wouldn't be surprised that in 20 years it's actually seen as a dead end. I'm also impressed, but at the same time, I'm seeing the limitations it has for practical use. The hypotheses about why pure neural net approaches are going to be too problematic to use are basically coming true. AI models that can't give human-comprehensible reasons for their conclusions, including attestation of sources, are too dangerous to use. They're just black boxes, and for all you know someone's got their finger on the scale of the black boxes. OpenAI is already doing that, quite visibly, and even if you are comfortable with their reasons for doing so today, you should conclude from the fact they basically immediately stuck their fingers on the scale that you aren't getting some super AI to answer your questions, but a manifestation of some particular group of human's answer to your questions. But... I can already get that! I don't need to pay OpenAI to AI-wash their answers.
If you want to implement it directly, good;
if you want to implement what will spawn it, good;
if you want to implement an "[evolutionary] genetic algorithm" as said spawner - so that the population of the entities in need to find solutions in the solution space will progressively develop a model of said world and a logic that works in it -, good.
If you built a mannequin and wanted to call it a woman... Bad.
I genuinely don't know if anyone knows the answer to that.
But I do know the perceptron is a toy model of an organic neuron, and that deep learning is a toy model of larger structures such as a cortical column.
And I do know some AI (not sure about GPT in particular) are trained via genetic algorithms.
You appear to be awfully confident we haven't implemented intelligence, even by the standard stated in your reply.
This writer individually: no, not literally «dismissing», it is just that I could not grasp precisely your point in this specific area. And I would say, as I wrote just earlier, «Inability to recognize intelligence is and will be devastating»: it already happens that an inability to discriminate ("It takes it to see it") will hide from the sight to some manager the critical risks that the underdeveloped sense of some workers will pose, and such risk will increase when they will have to compete with even riskier and less endowed entities that may be confused for acceptable - since this is what has been showing even here in the past times.
This issue comes from a devaluation of actual intelligence.
> If we're setting the bar of personhood or dignity to being exceptional researchers and engineers
Not really. Look, a few weeks ago this HN member had some heavy exchange with others to which it was said "there is no intelligence if there is no critical thinking", and some arrived to call that position "delirious". Now a rebuttal would have been, "Ask your grandmother". Because there is a "high culture", that of the Professor and the Professional, and "low culture", that of the Teacher and of the Relative, it does not take the former to have good judgement - the latter suffices plentifully, when not polluted.
So, you do not need to have the bar set to «exceptional researchers and engineers» - just a good grandmother. Who could have been an «exceptional researchers and engineer», in case, if life so determined - because "the requirements were there", available.
the point was very definitely not about «hav[ing] been denied opportunities», the idea of the relevance of a «gender» and gender issues is completely only thrown in by the reader, that of the «1950s» confirms misunderstanding because the point was not localized in space and time:
I very literally stated that "you can set the bar to" «just a good grandmother». The reference to «exceptional researchers and engineers» related to the grandmother was just that "you do not need a Professor, but something that has the basic requirements - good sense, intelligence - to become one in case, suffices.
It is not the bar "«of personhood or dignity»" as the poster originally proposed: it is the bar to be a proper social actor. And it is a requirement that has always been there, and which today is in the highlight, given that some are advancing the idea that a pseudo-parrot may suffice.
"Good sense" should better return as a definite Value.