ChatGPT for Teams

436 points by szermer 2 years ago | 418 comments

Announcement post: https://openai.com/blog/introducing-chatgpt-team

sholladay 2 years ago |

I want ChatGPT for Family.

The free version gets a lot of use around here but the most powerful feature is the ability to search the web, which is only available to paid users. I pay $20/month for myself and I’d happily pay a bit more for the whole family, but not $20/month per person - it adds up. Family members end up asking to borrow my phone a lot to use it.

Give me a 3-4 person plan that costs $30-$40/month. You’re leaving money on the table!

freediver 2 years ago | |

At Kagi we plan to offer this for $20/mo and 6 family members included. You get both paid search (much better than openai bing) + AI (gpt-3.5-turbo/claude-instant). If you need gpt-4 it will be an optional $15/mo upgrade per family member.

idiotsecant 2 years ago | | |

This is not a comment on the Kagi service, but more a comment on transitions in general. I have tried Kagi and I think it's great. I really want to use Kagi. I want to support Kagi. I have a mental stickynote that says 'start using Kagi on everything'. Every time I sit down to do some tasks it just falls to the bottom of the to-do pile because I feel like there's so many devices I now need to go through and update. Google really has a powerful advantage by bundling search in with the browser product. Isn't that what got microsoft into anti-trust trouble? How is it allowed?

uneoneuno 2 years ago | | |

I've been playing around with the assistant stuff and adding !expert to my searches to see what the LLM spits out first as a quick check. I'd love if I could get my custom assistant to work - sounds like a lot of fun to be had there.

fouc 2 years ago | | |

is gpt-3.5-turbo/claude-instant better than the model that free tier of chatgpt uses? FWIW, from my testing dolphin-2.5-mixtral-8x7b was clearly better than free tier chatgpt.

freedomben 2 years ago | | |

Nice, I'm looking forward to that! You guys have some pretty outstanding AI chops going. I've been really impressed!

twodayslate 2 years ago | | |

Sounds great. Will that plan also have access to the Search API which is currently restricted to Teams plans?

hmottestad 2 years ago | |

I haven't found the web search feature particularly useful or helpful. Far too many sites are blocking the ChatGPT bot. I also find that ChatGPT isn't getting any better search results that I would if I searched for something myself. Quality of the results varies a lot too, and ChatGPT doesn't really seem to be able to distinguish between high quality content and not so high quality content.

For software development I find that Phind is pretty good at combining search results with GPT-4 in a way that increases the quality of the result.

Maybe OpenAI can convince the Bing team to index everything using their embeddings. If ChatGPT could also read the text directly from Bind instead of having to "surf the web" it would be able to consume several search results at the same time. In the future I could even see Bing et al. running an LLM over all text when indexing a page to extract info and stats like a summary, keywords, truthfulness, usefulness, etc.

freedomben 2 years ago | | |

Same experience. 90% of the time I ask it to summarize something, it can't because it's blocked. At least it has the decency to tell me that it's blocked rather than just failing (which is what Kagi does. Love Kagi, but that's a minor improvement they could make).

This is where I suspect Bard is going to be an absolute beast of a product. Ability to quickly and thoroughly consume a bunch of hits and find the best and summarize and such is something uniquely able for Google (and increasingly, Kagi)

osigurdson 2 years ago | | |

I feel that LLMs have the potential to reorganize the web. Instead of being ad sponsored, raw, high quality data will be priced and aggregated.

rachele_ 2 years ago | | |

A workaround for this is to print the site as a pdf and upload it to GPT.

lhl 2 years ago | | |

ChatGPT's web search is interminably slow and I've added to my custom prompt to not do web searches unless explicitly asked. However, I'd give Perplexity.ai a try - I've found it to be incredibly fast and useful (funnily enough, they largely also use Bing for search retrieval results) and if you pay for Pro (which I do now), you can also use GPT-4 for generating responses.

ringofchaos 2 years ago | | |

I also had some good experience woth default free version of phind. I was facing a issue in a python framework, which turned out to be a bug. Phind was able to pinpoint the github discussion where the issue was raised and also suggested workaround code example based on the github issue. No other free AI tools were able to do this.

px43 2 years ago | |

I have a custom GPT for telling my 3 year old bedtime stories. It's super cute to listen to the two of them collaborate back and forth where my kid will add new characters (friends from school, or stuffed animals) and new wacky twists to their adventures, and the storyteller GPT will come up with a new revised version.

It would be pretty rad if she could just have the app on her tablet with a family plan. She doesn't use it quite enough to justify getting her own subscription, but especially if we could share GPTs across devices, so she gets the ones I make for her, but doesn't get flooded with my work or research related GPTs.

krzyk 2 years ago | | |

Oh, how does your 3 year old interact with GPT?

BTW. I read once some person made automated generation of bed time stories (with childrens as the main characters) for his children using open AI API and speakers - I was quite amazed (not a thing I would do, but nice usage for gpt).

taylorhou 2 years ago | | |

ummm how do i get this? i've got a 5, 3, and 1 year old and would love this

siva7 2 years ago | |

I'm certain that they will soon release anything that promises more subscriptions. ChatGPT for Family, ChatGPT for Gov, and so on...

whycome 2 years ago | | |

ChatGPT for Kids™

worldsayshi 2 years ago | |

There seems to be a plethora of somewhat ChatGPT competitive alternatives that does search the web at this point though. Maybe try phind.com?

(Although I haven't yet myself tried any alternative that is clearly on par with ChatGPT 4)

unnouinceput 2 years ago | |

Can't you use the same account on multiple phones though? I thought this is a no brainer.

lhnz 2 years ago | | |

This is probably correct but I'd prefer that family don't read the conversations I've had, as even if I'm not saying anything too private, it feels too intrusive (it'd be a bit like reading my inner thoughts).

sholladay 2 years ago | | |

As a general rule, I don’t share account access. I can count on one hand the number of times I’ve made an exception to that rule and it was always for something relatively benign like Spotify. Privacy isn’t the only reason to avoid sharing, either.

I don’t even like that when my family picks up the remote, Apple TV assumes it’s me using the TV. They watch something and mess up my Up Next history and recommendations. I wish it supported using a PIN. I’ve thought about getting rid of the remote to force everyone to use their phone as a remote, because then it detects who is using it and automatically switches accounts. But that means everyone has to have an iPhone and have their phone charged, etc. Getting rid of the remote just for my convenience seems too inconsiderate.

londons_explore 2 years ago | |

Can't you just share the login details?

selfportrait 2 years ago | | |

Sharing the same space and turning off/on the custom instructions is also very annoying.

teleforce 2 years ago | |

Agreed on the most powerful feature is the ability to search the web. This feature single-handedly makes ChatGPT a very potent Google search alternative but without the dreaded advertisements.

eru 2 years ago | |

Bing's version can search the web.

cyanydeez 2 years ago | |

I guarantee you they aren't leaving money on the table. they're running the same techno capitalist playbook.

they want you hooked on apps, API, etc, before the real costs are brought in. they likely should be charging anywhere from 50-100$ depending on hours

minimaxir 2 years ago |

A notable feature here is "no training on your business data or conversations" which really shouldn't have to be a feature. (requests using the ChatGPT API already aren't trained upon)

wildpeaks 2 years ago |

Note that "no training on your data" is only for Team and Enterprise: https://openai.com/chatgpt/pricing

lhl 2 years ago | |

You can make a privacy request for OpenAI to not train on your data here: https://privacy.openai.com/

Alternatively, you could also use your own UI/API token (API calls aren't trained on). Chatbot UI just got a major update released and has nice things like folders, and chat search: https://github.com/mckaywrigley/chatbot-ui

happytiger 2 years ago | | |

It should be opt out by default: not opt in.

dan_bez 2 years ago | |

no training on API as well. I integrated it with Telegram over a year ago. For convenience rather than for cost savings. Been paying $2 per month on average ever since. And "No training on your data" is included.

queueueue 2 years ago | |

The API is not used for training purposes either. https://openai.com/enterprise-privacy

londons_explore 2 years ago | |

I suspect that user data isn't really valuable for training from anyway - the data will be full of users lying to the bot to try to manipulate it.

But "we won't train from your data" is a powerful marketing line, and differentiator between classes of customer, even if they have no intention to train from the data of anyone.

ta988 2 years ago |

A major change is that you cannot opt out from having your conversations used for training unless you are usig a team account which is pretty costly for a single person.

tedsanders 2 years ago | |

According to this, you can still opt out of training, but you have to turn off history: https://help.openai.com/en/articles/7730893-data-controls-fa...

OJFord 2 years ago | | |

That's been true for at least a month, not new with (though it may have been in anticipation of) teams support.

emsign 2 years ago | | |

Sneaky buggers

ec109685 2 years ago | |

This link lets individuals opt out: https://privacy.openai.com/policies

dizzydes 2 years ago |

OpenAI understand their tech lead isn't a sustainable moat, so are going for network effects. Similar to Slack Connect (shared channels).

weatherlite 2 years ago | |

I heard the no moat theory before and I don't get it. The open source models are about a year or two behind the latest ChatGPT in terms of quality. That means companies will always be willing to pay premium to use ChatGPT and not rely on open source. So even if/when Google and Apple (and perhaps Meta) catch up in terms of A.I quality, there's still so much money to be made for OpenAI. One interesting by product of late game capitalism like this is as more and more jobs get destroyed due to A.I, so will subscriptions. So it might be a mixed bag in the end for the tech giants if there's no real economy to buy the products anymore, but we're a long way from there.

hackerlight 2 years ago | | |

I think no moat vs moat is a false dichotomy. They have a moat (better researchers and data) and are about to make it even better (network effects).

goatlover 2 years ago | | |

It was on researcher's opinion at a competing company, and everyone treated it as fact.

phillipcarter 2 years ago | |

Yeah, this is something I've been saying as well. Their true "moat" is their network of people who know and understand how to know use their tech.

ttul 2 years ago | | |

It’s the “we will make this so easy for you that you never want to switch” moat. Definitely akin to Slack, which also has the integration glue to keep you on their platform. Even though there are many Slack alternatives now that are really great, most companies on Slack will opt to stay there rather than invest in migrating.

wand3r 2 years ago |

Adjacent question, leaving aside value proposition. Do companies pay for 1000 seats like this? I didn't realize slack is $5 a user a month. Do they discount this for bulk, or are companies paying $5k/month $60k/yearly? These subscriptions must really add up.

On All In, they discussed the leverage from AI tools and they probably also meant open source, but one of the companies just rolled their own instance of a big monthly SaaS product because it was such a big expense for the startup.

mikepurvis 2 years ago | |

I'm not really in the know, but I bet the enterprise discounts don't kick in until you're at the tens of thousands of users. In any case, $60k sounds like a lot as a top-line figure to some bean-counter, but all these sales pitches follow the same basic pattern:

- This is an essential, best-in-class tool. You wouldn't deny your employees a laptop or a free lunch, would you?

- $5/user/mo is a bargain compared to the hassle of building/hosting this yourself, punching holes in your firewall every time you need to receive a webhook, dealing with security and auth issues.

- $60k is half the cost of someone you don't need to hire on your in-house IT team. Does it make sense yet?

rrr_oh_man 2 years ago | | |

> I bet the enterprise discounts don't kick in until you're at the tens of thousands of users

I'll take that bet ;) Not really sure about OpenAI, but you can absolutely negotiate with almost any company.

marpstar 2 years ago | |

This is why the price of "Enterprise" level of SaaS is always "Contact Us". Contract deals (i.e. "lock-in") are negotiated behind the scenes.

ren_engineer 2 years ago | |

you'd be amazed at how many startups waste 100s of thousands(and millions) of dollars on buying seats for tools that barely anybody uses. Interest rate increases have made VC startups get a little smarter, but a few years ago it was really bad. Similar to how tons of startups burn huge amounts of money on AWS due to laziness

Aeolun 2 years ago | |

Yeah, companies really do. Once a year our company gets a really large bill (15k users, several services).

The thing is those same people need to be paid, and that’s a much (100x) larger bill, so the extra amount doesn’t really signify.

reallymental 2 years ago |

Ok, so there are now 2 tiers where they don't use our data to train the model?

The higher bandwidth is to clearly entice new customers, but the question remains, what happens to the old ChatGPT Plus users? Do their quotas get eaten up by these new teams?

yawnxyz 2 years ago | |

Looks like the $20/month PLUS plan DOES use your data to train the model now... (they seem to have removed that "feature" from the list in the side-by-side comparison)

Metricon 2 years ago | | |

Currently, if you disable chat history, you'll see this message:

Chat History is off for this browser. When history is turned off, new chats on this browser won't appear in your history on any of your devices, be used to train our models, or stored for longer than 30 days. This setting does not sync across browsers or devices.

obmelvin 2 years ago | | |

AFAIK, Plus has always trained on your conversation data. Enterprise and the API do not.

tempestn 2 years ago | | |

There used to be a form you could submit asking them not to train on your data. Absent some communication to the contrary I would hope that continues to be respected.

castles 2 years ago | |

It's not super obvious, but even with Plus you can opt-out of training.

Aside: If you can see other colleagues' interactions with the custom/private GPTs, it could be quite an efficient way to share knowledge, especially for people in disparate time zones.

reaperman 2 years ago | |

> what happens to the old ChatGPT Plus users? Do their quotas get eaten up by these new teams?

This is probably run on Microsoft servers (Azure, basically), not OpenAI servers, so it shouldn't directly compete for capacity. This is more of a "the pie got bigger" situation.

ashu1461 2 years ago |

I can see some good use cases - A custom gpt just trained on your code base can help you write test cases in your desired syntax. - A custom gpt trained on internal PRDs can help brainstorm better on the next set of features.

Hoping to see something good come out of this

ankit219 2 years ago | |

This version of teams does not do that. You can hook it up by creating Custom GPTs and add some amount of docs to a specific GPT for retrieval, but you cannot connect an entire codebase to ChatGPT to get answers. Github[1] had introduced the feature you are talking about a year or so ago. Not sure if people are using it.

Use cases I see are common ones - basic usage of ChatGPT but admin can control access. Provides ability for companies to bill directly instead of reimbursements, and have more control over it. HR docs and policies can be a separate GPT. Though nothing which requires multi level access control.

[1]: https://githubnext.com/projects/copilot-view/

ashu1461 2 years ago | | |

Github's feature is under private beta right now. I feel that it will be impactful.

UI components can be generated as per your UI guidelines, same for tests. Hoping for good things

realusername 2 years ago |

I'm not too suprised by the move, it's a classic segmentation steategy but I was surprised how poorly the example screenshots they gave reflect on the product.

You have one non actionable marketing answer, a growth graph created without axis (what are people going to do with that?) and a Python file which would be easier just to run to get the error.

That kind of reinforce my belief that those AI tools aren't without their learning curves despite being in plain English.

VincentEvans 2 years ago |

Here’s an idea - ChatGPT app for Apple Carplay. Right now while driving i often do “hey siri” - but instead of carrying on a conversation where I can ask clarifying questions, I am most greeted with “i cannot show you this information while driving”, because rather than summarizing the answer, Siri tries to show me some website link.

bix6 2 years ago |

“No training on your business data or conversations”

Does this mean they will still use your data for other non-training purposes?

hanspeter 2 years ago | |

Yes. They will use your data as input to the GPT model to deliver the reponse you have requested.

bix6 2 years ago | | |

Appreciate the funny response but that is obviously not the intention of my question

thinkingemote 2 years ago | |

maybe we will see "only human eyes can see your data" vs "no automated tools can see your data" in the future

mbesto 2 years ago | |

I mean, how else can you actually get a result without using your actual data...?

hereme888 2 years ago |

100 messages / 3 hrs, with a 32k context window. That's really cost effective and efficient for my use case!

Does anyone know if this applies to voice conversations? This is me while I'm driving: upload big PDF -> talk to GPT: "Ok, read to me the study/book/article word for word."

Good job OpenAI.

jarcoal 2 years ago | |

I'm confused -- wasn't ChatGPT upgraded to 128k tokens at their last release? Or was that just the API?

tedsanders 2 years ago | | |

Just the API.

codingdave 2 years ago | |

Why would you need AI to read a document word-for-word. That can be done already in various tools without needing to go through ChatGPT?

hereme888 2 years ago | | |

Could you direct me to a tool that is privacy preserving in Android (I consider my OpenAI privacy-friendly), and has such a quality of speech?

sagarpatil 2 years ago | |

GPT-4-1106-preview aka turbo has 128k tokens. Are they saying GPT-4 (0613) with 32k is better than GPT-4 Turbo?

TotoHorner 2 years ago | |

> 100 messages / 3 hrs

Sorry where do you see that? I only see "higher usage limits"?

hereme888 2 years ago | | |

https://help.openai.com/en/articles/8801707-what-is-the-mess...

That article doesn't say 100.

100 is what I read in the openai forums earlier today.

Roritharr 2 years ago |

The way they purposefully made the Enterprise Plan so much better than the Teams plan is genius, the pressure on Enterprises to "just do the right thing" is pretty heavy here, I'd bet this will make them more than billion before the year is over.

Terretta 2 years ago | |

It's not better. It starts with "Call Us" so no matter what it includes, it's worse.

brcmthrowaway 2 years ago |

Typical VC filler, this is a sad day for Open AI (space emphasis)

alvis 2 years ago |

I think team is a good add to strength the product vision. I just hope it can connect to Notion so that we don't need to re import all the data

oflordal 2 years ago |

Did anyone evaluate this compared to using api access through an external gui (i.e. continue.dev). For software dev did the cost end up higher? I am thinking this is can be more convenient (and I suppose engineers can more easily use it outside work as a perk). Given practical use across a team will vary you get a lower price when using api and perhaps additional opportunity for scripted use.

philip1209 2 years ago |

RIP to all the startups this just killed.

CamperBob2 2 years ago | |

Not your land, not your farm.

econner 2 years ago |

Our team has an Enterprise account, but individuals cannot access GPT-4 through the chat.openai.com interface. With teams, do individuals get access to GPT-4 through that interface? Is our account just broken somehow?

It seems odd we have enterprise but cannot access GPT-4 through the main ChatGPT interface.

athyuttamre 2 years ago | |

Do you have a ChatGPT Enterprise subscription purchased via our sales team? Or are you an API customer?

The former should have GPT-4 access; if not, that’s a bug, and I can look into it if you email me at atty@openai.com.

The API and ChatGPT are separate products, and usage or credits purchased for the API do not provide paid ChatGPT access.

hospitalJail 2 years ago |

I'd love if I could use both my users at the same time to ask 2x questions.

My wife uses chatgpt only a few times a day.

I guess I need to 2x my browsers. I don't think this would work on the phone because I believe I need my browser open for chatgpt to continue its computations.

msmenardi 2 years ago |

they don't want you to be able to communicate with your teammates without their knowledge

singularity2001 2 years ago |

Also part of the announcement:

The GPT store

https://news.ycombinator.com/item?id=38941158

https://chat.openai.com/gpts

bob1029 2 years ago |

I think assistants / agents are going to be the big thing this year.

I was working on something at the end November that was proposing competent PRs based upon request for work in a GH issue. I was about halfway through the first iteration of a prompt role that can review, approve and merge these PRs. End goal being a fully autonomous software factory wherein the humans simply communicate via GH issue threads. Will probably be back on this project by mid February or so. Really looking forward to it.

Bigger, more useful context is all I think I really want at this point. The other primitives can be built pretty quickly on top of next token prediction once you know the recipes.

thinkmassive 2 years ago |

Pricing:

$25 per user/month billed annually

$30 per user/month billed monthly

ed_mercer 2 years ago | |

Why would they make it more expensive than individual plans?

transcriptase 2 years ago | | |

It’s marked up so they can offer a “special” discount on the sales call, and the customer can report back to their superiors that they “negotiated” a better deal and saved their company $X.

CaveTech 2 years ago | | |

ENTERPRISE baby. Business tiers are almost universally more expensive than individual tiers.

ukuina 2 years ago | | |

The delta is the price for disallowing training on chats while retaining all functionality of the web interface and app (e.g., Voice chat).

phantomathkg 2 years ago | | |

1 point by phantomathkg 0 minutes ago | parent | next | edit | delete [–]

https://openai.com/chatgpt/pricing It is very clear on the highlight.

* Higher message cap. * Create and Share GPTs within workspace. * Admin console. * No training.

nickthegreek 2 years ago | | |

I bet they will prioritize this traffic as well.

dataking 2 years ago | | |

no training on the data; more requests per hour.

happytiger 2 years ago |

So basically only companies don’t get spied on now? Even paid accounts are subject to data collection by default?

DelightOne 2 years ago |

Sounds like an upgraded Plus with privacy, so 30$ for additional privacy compared to Plus.

joshspankit 2 years ago |

Are there any new OpenAI opt in links that we might be missing?

Last one I remember was OpenAI GPT-4 API

saliagato 2 years ago | |

ashot 2 years ago |

no collaboration though, for actual collaborative team spaces give vello.ai a try

rogerthis 2 years ago |

The pattern in the footer of Open AI pages is very annoying, unintelligent.

kannangce 2 years ago |

I thought teams would be cheaper than individual.

anonylizard 2 years ago | |

Have you ever seen an enterprise plan cheaper than the individual plans (Which are often free)?

Now normal software is priced to squeeze as much money as you can, enterprises can afford more, so are charged more. Individuals are highly price sensitive, so has to be very cheap.

GenAI is quite different in that its not 0 marginal cost, the marginal costs are probably at least 50% of the price. So the price difference between enterprise and individual plans will be far smaller than usual, due to the common cost base.

callalex 2 years ago | | |

No, per-seat cost for these kinds of things is ALWAYS cheaper than retail. However it does require you to set up a meeting with a sales rep who will then work tirelessly to expand the number of services you use and require longer commitments etc. With “enterprise” pricing, the sticker price is just the opening number in a negotiation, and basic theory tells us that the opening number must be large since it sets a ceiling.

The_Colonel 2 years ago | | |

There are many products which cater to both individual users and enterprises, and these will often charge individual users more.

WinRAR is 30 EUR per user when buying a single license, 9 EUR when buying 100 licenses.

ehPReth 2 years ago |

don’t even get the option to force SSO unless you “contact sales” for the enterprise tier from what I can see :/

sgammon 2 years ago |

Woah. Bold move.

mvkel 2 years ago |

CEO will be the first job that AI replaces

benreesman 2 years ago |

I’ve got my stuff rigged to hit mixtral-8x7, and dolphin locally, and 3.5-turbo, and the 4-series preview all with easy comparison in emacs and stuff, and in fairness the 4.5-preview is starting to show some edge on 8x7 that had been a toss-up even two weeks ago. I’m still on the mistral-medium waiting list.

Until I realized Perplexity will give you a decent amount of Mistral Medium for free through their partnership.

Who is sama kidding they’re still leading here? Mistral Medium destroys the 4.5 preview. And Perplexity wouldn’t be giving it away in any quantity if it had a cost structure like 4.5, Mistral hasn’t raised enough.

Speculation is risky but fuck it: Mistral is the new “RenTech of AI”, DPO and Alibi and sliding window and modern mixtures are well-understood so the money is in the lag between some new edge and TheBloke having it quantized for a Mac Mini or 4070 Super, and the enterprise didn’t love the weird structure, remembers how much fun it was to be over a barrel to MSFT, and can afford to dabble until it’s affordable and operable on-premise.

“Hate to see you go, love to watch you leave”.

EmilStenstrom 2 years ago | |

Here's a glossary to understand this post:

- mixtral-8x7 or 8x7: Open source model by Mistral AI.

- Dolphin: An uncensored version of the mistral model

- 3.5-turbo: GPT-3.5 Turbo, the cheapest API from OpenAI

- 4-series preview OR "4.5 preview": GPT-4 Turbo, the most capable API from OpenAI

- mistral-medium: A new model by Mistral AI that they are only serving through AI. It's in private beta and there's a waiting list to access it.

- Perplexity: A new search engine that is challenging Google by applying LLM to search

- Sama: Sam Altman, CEO of OpenAI

- RenTech: Renaissance Technologies, a secretive hedge fund known for delivering impressive returns improving on the work of others

- DPO: Direct Preference Optimization. It is a technique that leverages AI feedback to optimize the performance of smaller, open-source models like Zephyr-7B1.

- Alibi: a Python library that provides tools for machine learning model inspection and interpretation2. It can be used to explain the predictions of any black-box model, including LLMs.

- Sliding window: a type of attention mechanism introduced by Mistral-7B3. It is used to support longer sequences in LLMs.

- Modern mixtures: The process of using multiple models together, like "mixtral" is a mixture of several mistral models.

- TheBloke: Open source developer that is very quick at quantizing all new models that come out

- Quantize: Decreasing memory requirements of a new model by decreasing the precision of weights, typically with just minor performance degradation.

- 4070 Super: NVIDIA 4070 Super, new graphics card announced just a week ago

- MSFT: Microsoft

vincentrolfs 2 years ago | | |

I asked ChatGPT to rewrite the original post using your glossary, which worked well:

I've set up my system to use several AI models: the open-source Mixtral-8x7, Dolphin (an uncensored version of Mixtral), GPT-3.5 Turbo (a cost-effective option from OpenAI), and the latest GPT-4 Turbo from OpenAI. I can easily compare their performances in Emacs. Lately, I've noticed that GPT-4 Turbo is starting to outperform Mixtral-8x7, which wasn't the case until recently. However, I'm still waiting for access to Mistral-Medium, a new, more exclusive AI model by Mistral AI.

I just found out that Perplexity, a new search engine competing with Google, is offering free access to Mistral Medium through their partnership. This makes me question Sam Altman, the CEO of OpenAI, and his claims about their technology. Mistral Medium seems superior to GPT-4 Turbo, and if it were expensive to run, Perplexity wouldn't be giving it away.

I'm guessing that Mistral AI could become the next Renaissance Technologies (a hedge fund known for its innovative strategies) of the AI world. Techniques like Direct Preference Optimization, which improves smaller models, along with other advancements like the Alibi Python library for understanding AI models, sliding windows for longer text sequences, and combining multiple models, are now well understood. The real opportunity lies in quickly adapting these new technologies before they become mainstream and affordable.

Big companies are cautious about adopting these new structures, remembering their dependence on Microsoft in the past. They're willing to experiment with AI until it becomes both affordable and easy to use in-house.

It's sad to see the old technology go, but exciting to see the new advancements take its place.

neals 2 years ago | | |

Crazy, your post feels like downloading martial arts in the Matrix. I read the parent, didn't get a thing and though the guy was on substances. Read yours. Read the parent again. I speak AI now! I'm going to use this new power to raise billions!

benreesman 2 years ago | | |

I'm clearly spending far too much time tuning/training/using these things if a glossary to make my post comprehensible to HN is longer than my remark: thank you for correcting my error in dragging this sub-sub-sub-field into a thread of general interest.

azeirah 2 years ago | | |

That's an impressive list of jargon whaha

Love how deep the rabbithole has gone in just a year. I am unfortunately in the camp of understanding the post without needing a glossary. I should go outside more :|

Smerity 2 years ago | | |

I think you've done a great explanation expansion except I believe it's ALiBi ("Attention with Linear Biases Enables Input Length Extrapolation"), a method of positional encoding (i.e. telling the Transformer model how much to weight a distant token when computing the current output token). This has been used on various other LLMs[2].

[1]: https://arxiv.org/abs/2108.12409

[2]: n.b. Ofir Press is co-creator of ALiBi https://twitter.com/OfirPress/status/1654538361447522305

spuz 2 years ago | | |

As someone who follows AI pretty closely, this was unbelievably helpful in understanding the parent post. It's crazy how much there is to keep on top of if you don't want to fall behind everything that is going on in AI at the moment.

pandemic_region 2 years ago | | |

Did you just paste that into an LLM and asked it to create a glossary? :-P

(but seriously: Thanks !)

rrr_oh_man 2 years ago | | |

I love you, Emil

hmottestad 2 years ago | | |

Thanks for this. I was initially wondering what this new GPT 4.5 model was and if I had somehow missed out on something big.

Demiurge 2 years ago | |

I have 20 years of software development experience, and I couldn’t understand anything you said. Is there a dictionary for this new lingo, or am I just too mid?

kolinko 2 years ago | | |

He speaks very unclearly, instead of saying GPT-4-turbo he says 4.5 preview. 4.5 is invention of his.

Also mixtral medium - no idea of what he means by that.

Not to mention a claim that mixtral is as good as gpt-4. It’s on the quality of gpt3.5 at best, which is still amazing for an open source model, but a year behind openai

benreesman 2 years ago | | |

On reflection this thread is pretty clearly of general interest and my comment was more jargon that language, I hang out in ML zones too much.

For a broad introduction to the field Karpathy's YouTube series is about as good as it gets.

If you've got a pretty solid grasp of attention architectures and want a lively overview of stuff that's gone from secret to a huge deal recently I like this treatment as a light but pretty detailed podcast-type format: https://arize.com/blog/mistral-ai

vincnetas 2 years ago | | |

Now you know how your girlfriend feels when she hears you speak with other software people :) Excuse my assumptions if they are incorrect. I'm making projections from my own point of view.

andersa 2 years ago | | |

Just follow https://www.reddit.com/r/localllama to keep up to date on this stuff

appplication 2 years ago | | |

Yeah that was completely incoherent to me as well.

transitus 2 years ago | | |

We're all too mid. Luckily, these days we hoomans have AIs to help us understand other hoomans. Here is Gpt-4-1106-preview and Perplexity.ai versions trying to shed some light what was being said. https://pastebin.com/JuxfdrLg

Hilariously neither knows who is sama (Sam Altman, the Drama King of OpenAI), nor do they recognize when they themselves are being discussed.

Reading the responses in full also gives you a glimpse on specific merits or weaknesses of these systems, namely how up to date is their knowledge and lingo, explaining capabilities, and ability to see through multiple layers of referencing. Also showcases whether the AIs are willing to venture guessing to piece together some possible interpretation for hoomans to think about.

karmasimida 2 years ago | | |

He is all over the place, mixing tech specifics with unproven models.

Basically, he said he is happy with Mistral 8x7B and thinks it is on par/better comparing to OpenAI's closed source model.

walteweiss 2 years ago | | |

Oh thank you, I was reading and none of that made any sense to me. I thought it could be a presentation of some dumb AI output. Now I see I’m not alone.

ringofchaos 2 years ago | | |

Its a specific lingo evolved over last two years with rise of llms. Those who have been following development of LLMs would understand it.

imperialdrive 2 years ago | | |

Just had to say that the original comment, and then yours right after, is a great combo. Laughed my ass off :)

swyx 2 years ago | | |

respectfully, 20 yrs of software dev experience doesn't entitle you to understand the last 2 months of AI if you didn't spend the effort to keep up. jargon happens, its not your fault but also people need to communicate thoughts concisely given a base of knowledge. its ok to ask of course but the rest of us who have been keeping up can parse this well enough (even though I disagree with some of the assertions)

bmikaili 2 years ago | | |

They are referring to LLM models. It‘s not about how much software dev experience you have

sevagh 2 years ago | | |

Half LLM, half boomer

icelancer 2 years ago | |

> Mistral Medium destroys the 4.5 preview.

On what metrics? LMSys shows it does well but 4-Turbo is still leading the field by a wide margin.

I am using 8x-7b internally for a lot of things and Mistral-7b fine-tunes for other specific applications. They're both excellent. But neither can touch GPT-4-turbo (preview) for wide-ranging needs or the strongest reasoning requirements.

https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboar...

EDIT: Neither does mistral-medium, which I didn't discuss, but is in the leaderboard link.

benreesman 2 years ago | | |

Keep in mind that modern quantitative approaches to LLM evaluation have been effectively co-designed with the rise of OpenAI, and folks like Ravenwolf routinely disagree with the leaderboards.

There's also very little if any credible literature on what constitutes statistically significant on MMLU or whatever. There's such a massive vested interest from so many parties (the YC ecosystem is invested in Sam, MSFT is invested in OpenAI, the US is invested in not-France, a bunch of academics are invested in GPT-is-borderline-AGI, Yud is either a Time Magazine cover author or a Harry Potter fanfic guy, etc.) in seeing GPT-4.5 at the top of those rankings and taking the bold one at < 10% lift as state of the art that I think everyone should just use a bunch of them and optimize per use case.

I have my own biases as well and freely admit that I love to see OpenAI stumble (no I didn't apply to work there, yes I know knuckleheads who go on about the fact they do).

And once you factor in "mixtral is aligned to the demands of the user and GPT balks at using profanity while happily taking sides on things Ilya has double-spoken on", even e.g. MMLU is nowhere near the whole picture.

It's easy and cheap to just try both these days, don't take my word for which one is better.

leo150 2 years ago | | |

It just feels like “what LLM is better” becomes new “what GPU is better” type of talk. It’s great to find a clear winner, but at the end the gap between the leaders isn’t an order of magnitude.

asenna 2 years ago | |

Dolphin-mixtral is incredible for the size that it is. But I'm curious, have you tried Goliath-120b or the new `Mixtral_34Bx2_MoE_60B` (it's named Mixtral but the base is actually Yi).

Goliath is too big for my system but Mixtral_34Bx2_MoE_60B[1] is giving me some really good results.

PSA to anyone that does not understand what we're talkign about: I was new to all of this until two weeks ago as well. If you want to get up to speed with the incredible innovation and home-tinkering happening with LLMs, you have to checkout - https://www.reddit.com/r/LocalLLaMA/

I believe we should be at GPT4 levels of intelligence locally sometime later this year (Possibly with the release of Llama3 or Mistral Medium open-model).

[1] - https://huggingface.co/TheBloke/Mixtral_34Bx2_MoE_60B-GGUF

dudeinjapan 2 years ago | |

Speculative musings beckon, and we dare to embrace them. The crux of the matter appears to be the chasm that separates novel advancements from the moment they are quantified for mainstream consumption. Retaining vivid memories of past entanglements with industry titans, circumspectly explore and exploit these innovations until they become both affordable and practicable for on-premise utilization, finally unveiling competitive prowess. The overarching question looms large. Perhaps, Mistral has not yet amassed the financial resources commensurate with such largesse.

"My hips don't lie."

benreesman 2 years ago | | |

https://gist.github.com/b7r6/fde6fb3be9a752a989054e62905307f...

rvba 2 years ago | |

Was this generated by some AI? It it a parody?

benreesman 2 years ago | | |

I've made similar apologies upthread but I'm passionate about this being an inclusive conversation and so I'm trying to respond to everyone who I confused with all the jargon.

The trouble with the jargon is that it obfuscates to a high degree even by the standards of the software space, and in a field where the impact on people's daily lives is at the high end of the range, even by the standards of the software space.

HN routinely front-pages stuff where the math and CS involved is much less accessible, but for understandable reasons a somewhat tone-deaf comment like mine is disproportionately disruptive: people know this stuff matters to them either now or soon, and it's moving as quickly as anything does, and it's graduate-level material.

If you have concrete questions about what probably looks like word salad I'll do my best to clarify (without the aid of an LLM).

coldtea 2 years ago | |

Not sure what all the fuss is about about the incomprehensibility of this. It's a densely packed comment, information wise, and expects familiarity with the field, but there's nothing really that obscure about it.

I might not know half of the references like "sama" or "TheBloke", but I could understand the context of them all. Like:

"the lag between some new edge and TheBloke having it quantized for a Mac Mini or 4070 Super,"

Not sure who TheBloke is, but he obviously means "between some new (cutting) edge AI model, and some person scaling it to run on smaller computers with less memory".

Similarly, not sure who Perplexity is, but "Until I realized Perplexity will give you a decent amount of Mistral Medium for free through their partnership" basically spells out that they're a service provider of some kind, that they have partnered with Mistral AI, and you get to use the Mistral Medium model through opening a free account on Perplexity.

I mean, duh!

Prcmaker 2 years ago | |

I'm still waiting for the AI encabulator.

LeonM 2 years ago | | |

Had a good laugh about your comment, then realized that this is _exactly_ what AI would be really good at...

Basically let an AI hallucinate on some technical subject. It would make a great script for a new encabulator video.

eurekin 2 years ago | |

Care to share, what are you using it for?

I'm curious, because I'm gathering some usecases; so that I could share that internally in the company to provide better education on, what LLMs do and how they work.

hermiod 2 years ago | |

Any chance you could post some comparisons between Mistral medium and gpt-4 turbo? I'm curious where you think it's more impressive, I hadn't spent the time to evaluate it yet.

icelancer 2 years ago | | |

Go to the Arena (side-by-side) tab on LMsys and you can try it yourself!

https://chat.lmsys.org/

It's a great tool they make available.

pyinstallwoes 2 years ago | |

Can you share some examples of how you are using it? Mixtral that is? What's your setup? What's your flow/workflow?

benreesman 2 years ago | | |

I screenshotted my emacs session upthread in a bit of a cheeky "AI-talking-about-AI" joke: https://imgur.com/WDrqxsz.

While I heavily rely on `emacs` as my primary interface to all this stuff, I'm slowly-but-surely working on a curated and opinionated collection of bindings and tools and themes and shit for all the major hacker tools (VSCode, `nvim`, even to a degree the JetBrains ecosystem). This is all broadly part of a project I'm calling `hyper-modern` which will be MIT if I get to a release candidate at all.

I have a `gRPC` service that wraps the outstanding work by the "`ggeranov` crew" loosely patterned on the sharded model-server architectures we used at FB/IG and mercilessly exploiting the really generous free-plan offered by the `buf.build` people (seriously, check out the `buf.build` people) in an effort to give hackers the best tools in a truly modern workflow.

It's also an opportunity to surface some of the outstanding models that seem to have sunk without a trace (top of mind would be Segment Anything out of Meta and StyleTTS which obsoletes a bunch of well-funded companies) in a curated collection of hacker-oriented capabilities that aren't clumsy bullshit like co-pilot.

Right now it's a name and a few thousand lines of code too rough to publish, but if I get it to a credible state the domain is `https://hyper-modern.ai` and the code will be MIT at `https://github.com/hyper-modern-ai/`.

no_streams 2 years ago | |

I'm curious about your workflow including all of these, is it only for your curiousity? Do you switch between them for specific tasks, or even run them in parallel for some purpose?

Also, is anyone aware of a service that supplies API endpoints for dolphin? I'd love to experiment with it, but running locally exceeds my budget.

nopinsight 2 years ago | |

Curious that you mentioned "4.5-preview". What do you mean there?

To my knowledge, and I searched to confirm, GPT-4.5 is not yet released. There were some rumors and a link to ChatGPT's answer about GPT-4.5 (could also be a hallucination) but Sam tweeted it was not true.

callalex 2 years ago | | |

They literally made it up.

EmilStenstrom 2 years ago | | |

They meant GPT-4 Turbo, which is an improvement over GPT-4.

pama 2 years ago | |

Thanks for the insights. What is your typical Emacs workflow for using and comparing the models?

benreesman 2 years ago | | |

I'm running custom stuff that I plan/hope to MIT soon, but `gptel` is killer and I've substantially plagiarized feature-wise it in my own dots. (I don't intend to release anything under a more permissive license that it was published under, merely that it sets the bar on a good interface and I plan to launch nothing less capable).

kidsil 2 years ago | |

I understand some of these words.

In all seriousness, are self hosted GPT alternatives really viable?

brcmthrowaway 2 years ago | |

If anyone understands this post you are worth a million dollars. Get that bag!

logicchains 2 years ago | |

>Alibi

Do you have a source on Mistral/Mixtral using that?

benreesman 2 years ago | | |

No, they could be using any of the variants of pointwise scalar trig-style embedding, one imagines it's at least a little custom to their particular training setup.

It was just an example of a modern positional encoding. I regret that I implied inside knowledge about that level of detail. They're doing something clever on scalar pointwise positional encoding but as for what who knows.

megablast 2 years ago | |

What a non sensical statement

shrx 2 years ago |

Could a moderator change the "Teams" in the title to lowercase (as it is in the article)? Capitalizing Teams misleadingly implies it's regarding Microsoft's chat platform.

laborcontract 2 years ago |

At the end of the day I wonder what openai's endgame is here. They're starting to expand their business in a way that geometrically grows the size of the team, overlapping products that microsoft is offering, making the whole non-profit/capped-profit thing a head scratcher.

I guess you can argue this is just a marginal add-on to their existing ChatGPT product but I can imagine seeing them go full Salesforce/Oracle/enterprise behemoth here.

I would say I'm very pro AI development and pro Sam reinstating but I've been starting to shake my head a bit. Their mission and their ambition are wildly different.

martinky24 2 years ago | |

It’s pretty obvious that once they realized how much money was on the table, the “non-profit” aspirations and goals went out the window. The Altman saga from a few months ago painted this clearly.

engineer_22 2 years ago | | |

How much money is on the table?

toomuchtodo 2 years ago | |

I would assume the endgame is like Microsoft: to become an OS for your org. Knowledge management, human augmentation (code, emails, copilot all the things), data analytics, workflow automations, etc.

The mission changed when research ran into product market fit.

waynesonfire 2 years ago | |

I kinda see it differently. There are these incredible use-cases for what they can do with this technology but still requires massive R&D and politics. They're taking the path of least resistance with these features. They should spin off R&D and let another division handle this low-hanging fruit garbage. But, maybe this is just how a business cycle works. You get a bite and you milk it for what it's worth and let the next generation organization take it to the next level.

m3kw9 2 years ago | |

This is a way to create moat, you think Zoom can survive open source with just bland features serving moms and pops? That’s how you get them to stay out of open source

wilg 2 years ago | |

> At the end of the day I wonder what openai's endgame is here

Sell AI products to fund making AI

ChatGTP 2 years ago | | |

At some point I think there is a conflict though, the more powerful the AI models, the more risk there is to their own business.

padjo 2 years ago |

The Engineering example is absolutely hilarious. Sure, I’m going to copy paste my code into an AI assistant to ask it about a bug that a linter would spot in realtime as I wrote the code.

callalex 2 years ago | |

I agree with you completely, but the target audience of people who will do such a thing have no clue what a linter, lexer, or parser are. Maybe even a compiler. And that audience is much larger than us folks at the ripe old age of 25+ even realize.

gumballindie 2 years ago |

The sooner we build a tool to filter out chatgpt generated garbage the better.

aleph_minus_one 2 years ago | |

> The sooner we build a tool to filter out chatgpt generated garbage the better.

The sooner we build a tool to filter out garbage the better.

FTFY

conception 2 years ago | |

We can’t filter out human generated garbage. Not sure how AI will be easier.

jes5199 2 years ago | | |

maybe we can ask the AI to filter it

asicsarecool 2 years ago | |

Maybe we extend UTF so characters can have an AI generated flag.

People could work around it but it might help

victor9000 2 years ago |

meh