Not every little thing wants an LLM: A framework for evaluating when AI is sensible


Be a part of our every day and weekly newsletters for the most recent updates and unique content material on industry-leading AI protection. Be taught Extra


Query: What product ought to use machine studying (ML)?
Undertaking supervisor reply: Sure.

Jokes apart, the arrival of generative AI has upended our understanding of what use circumstances lend themselves finest to ML. Traditionally, we have now at all times leveraged ML for repeatable, predictive patterns in buyer experiences, however now, it’s doable to leverage a type of ML even with out a whole coaching dataset.

Nonetheless, the reply to the query “What buyer wants requires an AI answer?” nonetheless isn’t at all times “sure.” Giant language fashions (LLMs) can nonetheless be prohibitively costly for some, and as with all ML fashions, LLMs will not be at all times correct. There’ll at all times be use circumstances the place leveraging an ML implementation shouldn’t be the appropriate path ahead. How will we as AI challenge managers consider our clients’ wants for AI implementation?

The important thing concerns to assist make this determination embrace:

  1. The inputs and outputs required to satisfy your buyer’s wants: An enter is supplied by the shopper to your product and the output is supplied by your product. So, for a Spotify ML-generated playlist (an output), inputs might embrace buyer preferences, and ‘preferred’ songs, artists and music style.
  2. Mixtures of inputs and outputs: Buyer wants can differ based mostly on whether or not they need the identical or totally different output for a similar or totally different enter. The extra permutations and combos we have to replicate for inputs and outputs, at scale, the extra we have to flip to ML versus rule-based programs.
  3. Patterns in inputs and outputs: Patterns within the required combos of inputs or outputs aid you resolve what sort of ML mannequin you want to use for implementation. If there are patterns to the combos of inputs and outputs (like reviewing buyer anecdotes to derive a sentiment rating), think about supervised or semi-supervised ML fashions over LLMs as a result of they could be cheaper.
  4. Value and Precision: LLM calls will not be at all times low cost at scale and the outputs will not be at all times exact/actual, regardless of fine-tuning and immediate engineering. Typically, you’re higher off with supervised fashions for neural networks that may classify an enter utilizing a hard and fast set of labels, and even rules-based programs, as a substitute of utilizing an LLM.

I put collectively a fast desk under, summarizing the concerns above, to assist challenge managers consider their buyer wants and decide whether or not an ML implementation looks like the appropriate path ahead.

Sort of buyer wantInstanceML Implementation (Sure/No/Relies upon)Sort of ML Implementation
Repetitive duties the place a buyer wants the identical output for a similar enterAdd my electronic mail throughout numerous types on-lineNoMaking a rules-based system is greater than ample that can assist you together with your outputs
Repetitive duties the place a buyer wants totally different outputs for a similar enterThe client is in “discovery mode” and expects a brand new expertise after they take the identical motion (reminiscent of signing into an account):

— Generate a brand new art work per click on

StumbleUpon (keep in mind that?) discovering a brand new nook of the web by means of random search

Sure–Picture era LLMs

–Advice algorithms (collaborative filtering)

Repetitive duties the place a buyer wants the identical/comparable output for various inputs–Grading essays
–Producing themes from buyer suggestions
Relies uponIf the variety of enter and output combos are easy sufficient, a deterministic, rules-based system can nonetheless give you the results you want. 

Nevertheless, when you start having a number of combos of inputs and outputs as a result of a rules-based system can not scale successfully, think about leaning on:

–Classifiers
–Subject modelling

However provided that there are patterns to those inputs. 

If there are not any patterns in any respect, think about leveraging LLMs, however just for one-off eventualities (as LLMs will not be as exact as supervised fashions).

Repetitive duties the place a buyer wants totally different outputs for various inputs –Answering buyer assist questions
–Search
SureIt’s uncommon to come back throughout examples the place you possibly can present totally different outputs for various inputs at scale with out ML.

There are simply too many permutations for a rules-based implementation to scale successfully. Think about:

–LLMs with retrieval-augmented era (RAG)
–Resolution timber for merchandise reminiscent of search

Non-repetitive duties with totally different outputsAssessment of a lodge/restaurantSurePre-LLMs, this kind of situation was tough to perform with out fashions that have been educated for particular duties, reminiscent of:

–Recurrent neural networks (RNNs)
–Lengthy short-term reminiscence networks (LSTMs) for predicting the following phrase

LLMs are a terrific match for this kind of situation. 

The underside line: Don’t use a lightsaber when a easy pair of scissors might do the trick. Consider your buyer’s want utilizing the matrix above, taking into consideration the prices of implementation and the precision of the output, to construct correct, cost-effective merchandise at scale.

Sharanya Rao is a fintech group product supervisor. The views expressed on this article are these of the writer and never essentially these of their firm or group.


Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles