Llama 3.1 Megathread

Mistral seems to be the popular choice. I think it’s the most open-source friendly out of the bunch. I will keep function calling in mind as I design some of our models! Thanks for bringing that up.

Blaed@lemmy.world · edit-2 1 year ago

We're building FOSAI models! Cast your votes and pick your tunings.

Blaed@lemmy.world · 1 year ago

HyperTech News Report #0002 - A New Challenger Approaches!

Blaed@lemmy.world · 1 year ago

HyperTech News Report #0002 - A New Challenger Approaches!

Blaed@lemmy.world · 1 year ago

HyperTech News Report #0001 - Happy FOSAI Friday!

Blaed@lemmy.world · 1 year ago

HyperTech News Report #0001 - Happy FOSAI Friday!

Blaed@lemmy.world · 1 year ago

CodeLlama-34B - the First Open-Source Model Beating GPT-4 on HumanEvals

Blaed@lemmy.world · 1 year ago

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

Blaed@lemmy.world · 1 year ago

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

Blaed@lemmy.world · 1 year ago

Introducing Stable-Diffusion.cpp (Inference in Pure C/C++)

Blaed@lemmy.world · 1 year ago

Incognito Pilot: The Next-Gen AI Code Interpreter for Sensitive Data

Blaed@lemmy.world · edit-2 1 year ago

Incognito Pilot: The Next-Gen AI Code Interpreter for Sensitive Data

Blaed@lemmy.world · edit-2 1 year ago

I used to feel the same way until I found some very interesting performance results from 3B and 7B parameter models.

Granted, it wasn’t anything I’d deploy to production - but using the smaller models to prototype quick ideas is great before having to rent a gpu and spend time working with the bigger models.

Give a few models a try! You might be pleasantly surprised. There’s plenty to choose from too. You will get wildly different results depending on your use case and prompting approach.

Let us know if you end up finding one you like! I think it is only a matter of time before we’re running 40B+ parameters at home (casually).

Blaed@lemmy.world · 1 year ago

I am actively testing this out. It’s hard to say at the moment. There’s a lot to figure out deploying a model into a live environment, but I think there’s real value in using them for technical tasks - especially as models mature and improve over time.

At the moment, though, performance is closer to GPT 3.5 than GPT 4, but I wouldn’t be surprised if this is no longer the case within the next year or so.

Blaed@lemmy.world · 1 year ago

Assuming everything from the papers translate into current platforms, yes! A rather significant one at that. Time will tell us the true results as people begin tinkering with this new approach in the near future.

Blaed@lemmy.world · 1 year ago

Thanks for reading! I’m glad you enjoy the content. I find this tech beyond fascinating.

Who knows, over time you might even begin to pick up on some of the nuance you describe.

We’re all learning this together!

Blaed@lemmy.world · 1 year ago

Thanks for sharing this!

Blaed@lemmy.world · 1 year ago

Good bot, I will do that next time.

Blaed@lemmy.world · edit-2 1 year ago

Come hangout with us at !fosai@lemmy.world

I run this show solo at the moment, but do my best to keep everyone informed. I have much more content on the horizon. Would love to have you if we have what you’re looking for.

FOSAI Posts:

Moderates