The part of the entire "AI" - but not-artifical and not-intelligent - marketplace that I find amusingly curious is when any users expect the large model providers to take actions which are not 100% objectively for the purpose of resulting in more income from less computation resource.
Any action on the part of the model provider, which is not completely in the direction of more money for less work, is actually working against the interest of the provider, the shareholders, and various investors. Any such action would open the specific employees or parties taking that non-profit maximizing action to official review, reversal of action, and possible reduction in responsibility or termination of duties. That is completely within the job description of employment at any of the places, including Anthropic.
So, users, be informed, the model providers are not allowed to take user's best interest in to scope for making utility decisions.
The company will at it's own discretion; reduce customers' allowance of capability, reduce the computational cost for performing the same actions at a later date, reduce the precision and utility of results iteratively as the number of users increases overall demand. Model providers will showcase higher performance and precision to non-paying 'potential customers' and then nominally deliver increasingly reduced performance up-to the point of a specific and acceptable ratio of account cancellations.
This is specifically in order for the model providers to extract as much coin for as little work as possible: that is the purpose of the many profitability matrices and customer load balancing 'knobs' that are run using the very best models. It is a given and very reasonable to expect an explicit requirement from the top that the internally facing model capacity is NEVER degraded for the tasks of increasing revenue per unit of work; that is, to make as much as they can at any point in time, and make more at every point in time than they did before.
The responsible use of these models should be; use the best performance (i.e. the free tier) to assist in the development of infrastructure to run models which are actually under your control; either on in-house hardware, or on leased VPS instances with specific hardware and performance guarantees. Then use models under one's own control to develop stable and irrevocably performant workflows.
Any reliance on public facing subscription models for actual work is 100% guaranteed to deteriorate in functionality over time and waste money trying to maintain performance as measured by cost per token over time. That is the fact of the matter.
/<today's free lesson>