Skip to main content

Deck.blue brings a TweetDeck experience to Bluesky users

With over 3 million users and plans to open up more broadly in the months ahead, Bluesky is still establishing itself as an alternative to Twitter/X. However, that hasn’t stopped the developer community from embracing the project and building tools to meet the needs of those fleeing the now Elon Musk-owned social network, formerly known […] © 2024 TechCrunch. All rights reserved. For personal use only. from TechCrunch https://ift.tt/TBbEAPF

There’s now an open source alternative to ChatGPT, but good luck running it

The first open-source equivalent of OpenAI’s ChatGPT has arrived, but good luck running it on your laptop — or at all.

This week, Philip Wang, the developer responsible for reverse-engineering closed-sourced AI systems including Meta’s Make-A-Video, released PaLM + RLHF, a text-generating model that behaves similarly to ChatGPT. The system combines PaLM, a large language model from Google, and a technique called Reinforcement Learning with Human Feedback — RLHF, for short — to create a system that can accomplish pretty much any task that ChatGPT can, including drafting emails and suggesting computer code.

But PaLM + RLHF isn’t pretrained. That is to say, the system hasn’t been trained on the example data from the web necessary for it to actually work. Downloading PaLM + RLHF won’t magically install a ChatGPT-like experience — that would require compiling gigabytes of text from which the model can learn and finding hardware beefy enough to handle the training workload.

Like ChatGPT, PaLM + RLHF is essentially a statistical tool to predict words. When fed an enormous number of examples from training data — e.g. posts from Reddit, news articles and ebooks — PaLM + RLHF learns how likely words are to occur based on patterns like the semantic context of surrounding text.

ChatGPT and PaLM + RLHF share a special sauce in Reinforcement Learning with Human Feedback, a technique that aims to better align language models with what users wish them to accomplish. RLHF involves training a language model — in PaLM + RLHF’s case, PaLM — and fine-tuning it on a data set that includes prompts (e.g. “Explain machine learning to a six-year-old”) paired with what human volunteers expect the model to say (e.g. “Machine learning is a form of AI…”). The aforementioned prompts are then fed to the fine-tuned model, which generates several responses, and the volunteers rank all the responses from best to worst. Finally, the rankings are used to train a “reward model” that takes the original model’s responses and sorts them in order of preference, filtering for the top answers to a given prompt.

It’s an expensive process, collecting the training data. And training itself isn’t cheap. PaLM is 540 billion parameters in size, “parameters” referring to the parts of the language model learned from the training data. A 2020 study pegged the expenses for developing a text-generating model with only 1.5 billion parameters at as much as $1.6 million. And to train the open source model Bloom, which has 176 billion parameters, it took three months using 384 Nvidia A100 GPUs; a single A100 costs thousands of dollars.

Running a trained model of PaLM + RLHF’s size isn’t trivial, either. Bloom requires a dedicated PC with around eight A100 GPUs. Cloud alternatives are pricey, with back-of-the-envelope math finding the cost of running OpenAI’s text-generating GPT-3 — which has around 175 billion parameters — on a single Amazon Web Services to be around $87,000 per year.

Sebastian Raschka, an AI researcher, points out in a LinkedIn post about PaLM + RLHF that scaling up the necessary dev workflows could prove to be a challenge as well. “Even if someone provides you with 500 GPUs to train this model, you still need to have to deal with infrastructure and have a software framework that can handle that,” he said. “It’s obviously possible, but it’s a big effort at the moment (of course, we are developing frameworks to make that simpler, but it’s still not trivial, yet).”

That’s all to say that PaLM + RLHF isn’t going to replace ChatGPT today — unless a well-funded venture (or person) goes to the trouble of training and making it available publicly.

In better news, several other efforts to replicate ChatGPT are progressing at a fast clip, including one led by a research group called CarperAI. In partnership with the open AI research organization EleutherAI and startups Scale AI and Hugging Face, CarperAI plans to release the first ready-to-run, ChatGPT-like AI model trained with human feedback.

LAION, the nonprofit that supplied the initial data set used to train Stable Diffusion, is also spearheading a project to replicate ChatGPT using the newest machine learning techniques. Ambitiously, LAION aims to build an “assistant of the future” — one that not only writes emails and cover letters but “does meaningful work, uses APIs, dynamically researches information, and much more.” It’s in the early stages. But a GitHub page with resources for the project went live a few weeks ago.

There’s now an open source alternative to ChatGPT, but good luck running it by Kyle Wiggers originally published on TechCrunch



from TechCrunch https://ift.tt/DpMsZGI

Comments

Popular posts from this blog

Nimbus launches tiny EV prototype that’s like a motorbike with a roof

As shared e-scooter companies have infiltrated cities and e-bike sales have soared, micromobility has been offered up as a panacea to save us all from the ill humors and packed streets caused by gas-guzzling cars. However, one of the major roadblocks in front of well-intentioned city dwellers who’d love to trade in their cumbersome and environmentally unfriendly vehicles for an e-bike or scooter remains: What happens when it rains? Nimbus, a California-based electric vehicle startup, wants to solve that problem with a simple solution: Put a roof on it. The company recently came out of stealth with a prototype for its Nimbus One, a tiny, three-wheeled EV that “combines the convenience and cost of a motorbike with the safety and comfort of a car.” The Nimbus One. Image Credits: Nimbus The thin, pod-like vehicle is only about 2.75 feet wide and 7.5 feet long, which Nimbus says makes it three to five times smaller than a compact car — the better to park and navigate busy urban stree...

Pitch Deck Teardown: Encore’s $3M seed deck

For this week’s Pitch Deck Teardown, I’m (virtually) traveling to Sweden to take a look at the $3 million seed round raised by developer tool startup Encore . The company is creating what it calls a software development platform for the cloud. It reportedly raised from Crane Venture Partners with Acequia Capital ,  Essence Venture Capital  and  Third Kind Venture Capital joining the round. I wanted to take a look at this deck in more detail, in particular, because it tells a really elegant story in a market where it’s extraordinarily hard to differentiate yourself — both to your customers and to investors! Pitching a dev tool in a way that tells the story well enough to understand but without dropping deep into a rabbit hole is a particularly hard challenge, and that’s the needle Encore threads ever so efficiently in this 24-slide pitch deck. We’re looking for more unique pitch decks to tear down, so if you want to submit your own, here’s how you can do that ....

Multifamily housing has missed the solar boom. PearlX wants to fix that with $70M Series B

If you’re a renter and you want solar power, you’re usually out of luck. For most, the only option is a community solar program, where people subscribe to utility-scale projects, but they’re not available everywhere. And given that most renters only stay for a few years, which of them are going to pay tens of thousands of dollars for solar panels — and what landlord would let them? That’s where PearlX comes in. “Think of us as like the Sunrun for renters,” said co-founder and CEO Michael Huerta, referring to the company that rents solar installations to single-family homeowners. “PearlX is a rental electrification platform.” Earlier this year, the startup began installing solar panels and backup batteries at multifamily rentals in Texas as part of its “TexFlex” project. PearlX’s next step, which Huerta shared exclusively with TechCrunch, will be a California expansion called “Flexifornia.” The startup is also rolling out a virtual power plant, which will allow the company to tap the...