Wiki
FlickrLeetcode
  • 💞Artificial Intelligence
    • ⚙️Midjourney vs Stable Diffusion
    • ⚙️Creative QR Codes with ControlNet
      • ⚙️How to generate a QR Code
      • ⚙️Collect prompt
      • ⚙️clip skip
      • ⚙️AUTOMATIC1111
      • ⚙️Edge detection and human pose detection
    • ⚙️Stable Diffusion
    • ⚙️What is 'token' and 'turbo'?
    • ⚙️Today's learning--LangChain
      • ⚙️Prompt
    • ⚙️LLM Parameters Demystified
    • ⚙️What is Cohere playground?
    • ⚙️DALL-E
    • ⚙️How to use AI to learn something that you don't know?
    • ⚙️Diffusers
    • Boosting Algorithms in machine learning, part 1:AdaBoost
  • 💞GitHub
    • ✅How to add a issue of the code with GitHub?
    • ✅How to edit code?
    • ✅How to use GitHub Desktop
    • ✅How to review in GutHub?
  • 💞Lastest concepts for me
    • 🪅Pandas DataFrame
    • 🪅Choosing between loc and iloc
  • 💞Need to remember
    • 🔢An article for leetcode
    • 🉑two types of group work
    • 🍒What is hashtag#?
    • 🐝Week6_APD
    • 🦋API
    • 🎼BFF
  • 💞Python
    • 🐍argument & parameter
    • 🐍"{:.2f}"
    • 🐍Timeit in Python
    • 🐍Today's learning--Pylint
    • 🐍Split and Strip in Python
    • 🐍Getter and Setter in Python
    • 🐍"import json" in Python
    • 🐍Open CSV file in Python
    • 🐍print(f"An error occurred: {e}")
  • Page
  • 🪅command-line
  • 💞DataVisualization
    • 🪅How to choose plot type
  • 💞DataCleaning
    • 🪅Some basic code of data_cleaning
  • 💞Java
    • 🍡difference use between ArrayList and HashMap in Java
    • 🍡ArrayList & LinkedList
    • 🍡assertFalse(App.checkInputNumber(1,0))
      • 🍡HashSet
    • 🍡iterator
    • 🍡Java concept of assignment 1
    • 🍡Week6_Java
    • 🍡serializable
  • 💞Mark something that easily to forget
    • 🙉Mark something
    • 🙉How to edit cover picture from "Flickr" using "URL" in GitBook?
  • 💞VS Code
    • ✖️Install a project in VS Code
    • ✖️What should do after editing the code of one branch?
    • ✖️How to create a branch in VS code?
    • ✖️How to debug?
Powered by GitBook
On this page
  • What does tokenization have to do with the cost of running a model?
  • Azure OpenAI Service
  • Turbo
  • API
  • what is the difference between Java and JavaScript?
  • Node.js

Was this helpful?

Edit on GitHub
  1. Artificial Intelligence

What is 'token' and 'turbo'?

some concepts~

PreviousStable DiffusionNextToday's learning--LangChain

Last updated 1 year ago

Was this helpful?

In the context of LLM, tokens are the basic units of text or code that an LLM AI uses to process and generate language. Tokens can be characters, words, subwords, or other segments of text or code, depending on the chosen tokenization method or scheme. Tokenization is the process of splitting the input and output texts into smaller units that can be processed by the LLM AI models. Tokens are assigned numerical values or identifiers and are arranged in sequences or vectors, and are fed into or outputted from the model. The number of tokens parameter allows you to set a limit to how many tokens are generated. The natural limit to the number of tokens the model can produce varies depending on the model size, with smaller models going up to 1024 and larger models going up to 2048. It is generally recommended to generate in short bursts instead of one long burst to avoid the model going off in a direction you are not expecting.

What does tokenization have to do with the cost of running a model?

Tokenization affects the amount of data and the number of calculations that the model needs to process. The more tokens that the model has to deal with, the more memory and computational resources that the model consumes. Therefore, the cost of running an OpenAI or Azure OpenAI model depends on the tokenization method and the vocabulary size that the model uses, as well as the length and the complexity of the input and output texts. Based on the number of tokens used for interacting with a model and the different rates for different models, your costs can widely differ. For example, as of February 2023, the rate for using Davinci is $0.06 per 1,000 tokens, while the rate for using Ada is $0.0008 per 1,000 tokens. The rate also varies depending on the type of usage, such as playground, search, or engine. Therefore, tokenization is an important factor that influences the cost and the performance of running an OpenAI or Azure OpenAI model.

Azure OpenAI Service

It is a cloud-based service that provides REST API access to OpenAI's powerful language models, including the GPT-3, Codex, and Embeddings model seriers. The service also offers the new GPT-4 and ChatGPT(gpt-35-turbo) model series in preview. These models can be easily adapted to specific tasks, including content generation, summarization, semantic search, and natural language to code translation. Users can access the service through REST APIs, Python SDK, or the web-based interface in the Azure OpenAI Studio.

Turbo

Turbo is a team used in the context of OpenAI's language models, specially GPT-3.5 Turbo, which is an improved version of GPT-3.5 and GPT-3 Davinci. The GPT-3.5 Turbo model is OpenAI's latest and most advanced language model, which powers the popupar ChatGPT. It can accept a series of messages as input, unlike the previous version that only allowed a single text prompt. This capability unlocks some interesting features, such as the ability to generate more complex and meaningful responses. The GPT-3.5 Turbo model is part of the GPT-3 family of models that can understand and generate natural language or code.

API

Application Programming Interface. It is a set of rules and protocols that enables different software applications to communicate and interact with each other. APIs define the methods and data formats that can be used to access and manipulate the functionality and resources of a particular software system or service.

In simpler terms, an API acts as a bridge between different software components, allowing them to exchange information and perform actions.

what is the difference between Java and JavaScript?

Java and JavaScript are two different programming languages.

  • JavaScript is an object-oriented scripting language.

  • JavaScript code is run on a browser only.

  • JavaScript code are all in text.

  • JavaScript is a loosely typed language.

Node.js

It is an open-scorce, cross-platform runtime environment for building fast and scalable server-side and networking applications. It allows you to run JavaScript on the server. Node.js is built on Chrome's V8 Javascript engine and uses event-driven, non-blocking I/O architecture, which makes it efficient and suitable for real-time applications. Node.js is composed of Google's V8 JavaScript engine, the libUV platform abstraction layer, and a core library that is written in JavaScript. It is free and runs on various platforms, including Windows, Linux, Unix, and Mac OS X.

It can generate dynamic page content, create, open, read, write, delete, and close files on the server, collect form data, and add, delete, modify data in your database.

Node.js is also scalable and can handle a huge number of simultaneous connections with high throughput.

And it is used for a wide variety of applications, including real-time chats, video streaming sites, and online chatting

💞
⚙️
https://plainenglish.io/blog/beginners-guide-to-openai-s-gpt-3-5-turbo-model
https://www.ankursnewsletter.com/p/gpt-4-gpt-3-and-gpt-35-turbo-a-review