[WIP] v1.0.0 Updates #71

orangetin · 2024-01-31T10:39:16Z

To-Do:

Endpoints to support:

Example usage:

import os
import json

api_key = os.getenv("TOGETHER_API_KEY")

import together

client = together.Together(
  api_key=api_key
)

# Chat Completions
response = client.chat.completions.create(model="togethercomputer/llama-2-7b-chat", max_tokens=10, messages=[{"role": "user", "content": "hello there"}])
print(response.choices[0].message.content)

# Completions
response = client.completions.create(model="togethercomputer/llama-2-7b", max_tokens=10, prompt="hello there")
print(response.choices[0].text)

# Embeddings
response = client.embeddings.create(model="bert-base-uncased", input=["test"])
print(response.data[0].embedding)

# Fine Tuning
response = client.fine_tuning.create(training_file="file-6e432514-18e8-407d-b36e-ba904e4d4856", model="togethercomputer/llama-2-7b")
print(json.dumps(response.model_dump(), indent=4))

# Files
response = client.files.upload("unified_joke_explanations.jsonl")
print(json.dumps(response.model_dump(), indent=4))

# Images
response = client.images.generate(prompt="space robots", model="stabilityai/stable-diffusion-xl-base-1.0", steps=10, n=4)
print(response.data[0].b64_json)

# Models
response = client.models.list()
print(response[0].id)

Updated contribution style

Setting up pre-commit for dev:

poetry install --with quality,tests
pre-commit install

linear · 2024-01-31T10:39:19Z

ENG-900 Support messages (chat endpoint) in together python library

Multiple customers are getting confused that using prompt through the together python library uses the raw prompt since the preferred way to do it through the REST API and OpenAI package is using messages which adds prompt formatting.

Therefore, we want to support messages so that all 3 ways of using our inference API are consistent.

More context here: https://www.notion.so/together-docs/Prompt-template-discrepancy-proposal-a557d4fb7f5d49d59a9b79480e0926b9

src/together/resources/chat/completions.py

clam004 · 2024-03-14T12:39:16Z

here is my async demo

import os
import time
from together import Together

TOGETHER_API_KEY = os.getenv('TOGETHER_API_KEY')

def sync_chat_completion(messages, max_tokens):
    client = Together(api_key=TOGETHER_API_KEY)
    
    start_time = time.time()
    
    for message in messages:
        response = client.chat.completions.create(
            model="togethercomputer/llama-2-7b-chat", 
            max_tokens=max_tokens, 
            messages=[{"role": "user", "content": message}]
        )
        print(response.choices[0].message.content)
    
    end_time = time.time()
    print("Synchronous total execution time:", end_time - start_time, "seconds")

async def async_chat_completion(messages, max_tokens):
    async_client = AsyncTogether(api_key=TOGETHER_API_KEY)
    
    start_time = time.time()
    
    tasks = [async_client.chat.completions.create(
                model="togethercomputer/llama-2-7b-chat", 
                max_tokens=max_tokens, 
                messages=[{"role": "user", "content": message}]
             ) for message in messages]
             
    responses = await asyncio.gather(*tasks)
    
    for response in responses:
        print(response.choices[0].message.content)
    
    end_time = time.time()
    print("Asynchronous total execution time:", end_time - start_time, "seconds")

in jupyter notebook

messages = ["hi there what is the meaning of life?", "What country is Paris in?"]
sync_chat_completion(messages, 32)
await async_chat_completion(messages, 32)

otherwise

messages = ["hi there what is the meaning of life?", "What country is Paris in?"]
sync_chat_completion(messages, 32)
asyncio.run(async_chat_completion(messages, 32))

expected output

  The meaning of life is a question that has puzzled philosophers, theologians, and scientists for centuries. There are many different perspectives
  Paris is located in France. It is the capital and largest city of France, situated in the northern central part of the country.
Synchronous total execution time: 0.7738921642303467 seconds
  The meaning of life is a question that has puzzled philosophers, theologians, and scientists for centuries. There are many different perspectives
  Paris is located in France. It is the capital and largest city of France, situated in the northern central part of the country.
Asynchronous total execution time: 0.4429478645324707 seconds

README.md

Nutlope

Looks great Abhy, amazing work! Feel free to merge, then I can make a new PR to update the README (and update our actual docs), then we can open source this repo + announce.

orangetin added 2 commits January 31, 2024 02:35

WIP v0.3.0

9e8f943

Update packages

843d26a

orangetin added 4 commits February 21, 2024 00:55

Updates 02/21

3c458d3

Linter

18ba24e

Clean up

9f7d0a4

Remove httpx dependency

ed72b35

orangetin changed the title ~~[WIP] v0.3 Updates~~ [WIP] v1.0.0 Updates Feb 22, 2024

orangetin added 22 commits March 7, 2024 13:08

Add new modules for chat

b186c4e

update self._client instances

43f165b

function calling & json mode support!

c7ab715

fmt

1ad5deb

fix typing in TogetherResponse

f7baf60

fix async client

71f40f5

Add embeddings client

7de8508

ruff fmt

075ba3f

chore: async create function typings

d9bffcd

Add finetune typing and abstract BaseModel to include extra="allow"

6c35185

Bump version to v1.0.0

48b8c35

Update finetuning classes

55493ee

Fmt and fixes

2beb3a9

Add filelock dependency

de7d8e7

Refactor APIRequestor and add finetuning client

065118b

Init commit for Files/AsyncFiles class

b4fe66a

Placeholder files for images and models endpoint

6a5f65a

fmt

57deab7

Mypy typing clean up

cb84fc7

Add types-requests dependency

64f786c

Update packages

2519029

Comments and fmting

5edd8c1

Initial CLI commits

f1341e0

clam004 reviewed Mar 13, 2024

View reviewed changes

src/together/resources/chat/completions.py Outdated Show resolved Hide resolved

orangetin added 2 commits March 13, 2024 19:44

Remove accidentally committed print log statements

54dd54e

Require keyword arguments over positional arguments for inputs

b87799b

clam004 approved these changes Mar 14, 2024

View reviewed changes

orangetin added 2 commits March 13, 2024 22:47

Tune api_key/base_url validation

10aad05

Add Images class

92fcb44

orangetin force-pushed the orangetin/eng-900 branch from 68930b7 to 92fcb44 Compare March 14, 2024 06:08

orangetin added 3 commits March 13, 2024 23:33

Add Models class

3749518

Add more completions tests and tox

f9d0c1a

rename unit test workflow

e9798e3

orangetin added 3 commits March 19, 2024 02:29

Finish up CLI commands -- functionally finished

6638a6a

Add comments and update CLI --help

5c86b9e

Update version to v0.3.0

389fbc1

orangetin marked this pull request as ready for review March 19, 2024 17:26

orangetin requested a review from Nutlope March 19, 2024 17:26

orangetin force-pushed the orangetin/eng-900 branch from f6ccf10 to 389fbc1 Compare March 23, 2024 04:53

Lint + formatter

eb9e65f

orangetin changed the title ~~[WIP] v1.0.0 Updates~~ [WIP] v0.3.0 Updates Mar 23, 2024

orangetin added 4 commits March 22, 2024 22:23

Update pre-commit-hook

8cc0897

Add files check command

839794e

Annotations futures for older python versions

f88feef

Start updating README, contribution guide

660071d

Nutlope reviewed Apr 1, 2024

View reviewed changes

README.md Show resolved Hide resolved

Nutlope approved these changes Apr 1, 2024

View reviewed changes

orangetin changed the title ~~[WIP] v0.3.0 Updates~~ [WIP] v1.0.0 Updates Apr 1, 2024

orangetin merged commit fc781ee into main Apr 1, 2024

orangetin mentioned this pull request Apr 1, 2024

Check Parquet files in together-cli, supply filetype in header #73

Merged

orangetin deleted the orangetin/eng-900 branch April 5, 2024 19:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] v1.0.0 Updates #71

[WIP] v1.0.0 Updates #71

Uh oh!

orangetin commented Jan 31, 2024 •

edited

Loading

Uh oh!

linear bot commented Jan 31, 2024

Uh oh!

Uh oh!

clam004 commented Mar 14, 2024

Uh oh!

Uh oh!

Nutlope left a comment

Uh oh!

Uh oh!

[WIP] v1.0.0 Updates #71

[WIP] v1.0.0 Updates #71

Uh oh!

Conversation

orangetin commented Jan 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Updated contribution style

Uh oh!

linear bot commented Jan 31, 2024

Uh oh!

Uh oh!

clam004 commented Mar 14, 2024

Uh oh!

Uh oh!

Nutlope left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

orangetin commented Jan 31, 2024 •

edited

Loading