Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> These fine-tuned models are intended for research use only and are released under a noncommercial CC BY-NC-SA 4.0 license, in-line with Stanford’s Alpaca license.

This is a no-commercial-use-allowed license; it is neither considered free software nor open source, the definitions of which disallow restrictions on what you can use the work for.



The two sentences prior are important:

> We are also releasing a set of research models that are instruction fine-tuned. Initially, these fine-tuned models will use a combination of five recent open-source datasets for conversational agents: Alpaca, GPT4All, Dolly, ShareGPT, and HH. These fine-tuned models are intended for research use only and are released under a noncommercial CC BY-NC-SA 4.0 license, in-line with Stanford’s Alpaca license.

The snippet you quoted is not talking about the main model in the announcement. It's talking about fine-tuned models based on other models. Stability has to respect the license of the originals. They cannot change it.

The main model is described higher up in the post and is permissible for commercial:

> Developers can freely inspect, use, and adapt our StableLM base models for commercial or research purposes, subject to the terms of the CC BY-SA-4.0 license


It also appears that CC BY-SA-4.0 is GPL-compatible. Not a lawyer and this is not legal advice, but it certainly seems like one could operate their own StableLM server/service and allow proprietary code to use it over a network interface, much like one could use a GPL-licensed database system.

https://creativecommons.org/faq/#can-i-apply-a-creative-comm...


Interesting. A non-LLM hallucinating. And to think we used to believe that was only a property of LLMs.


Huh? Everything they said is true, isn’t it?


StableLM, the model that's theirs is CC-BY-SA 4.0.


The instruction tuned versions are under a non-commercial license though because some of the data they used is


Not their fault, the instruct-tuned models depend on non-open data.... Which should be open however. Scraping chatGpt is legal


Agreed. Scraping ChatGPT is against OpenAI terms of use and OpenAI is entitled to terminate your access immediately upon notice, but since ChatGPT output is not copyrighted (and copyrightable), output you acquired before termination should be freely redistributable. I am not sure why Stanford Alpaca authors think otherwise but they are wrong.

https://openai.com/policies/terms-of-use

Thank you for using OpenAI! These Terms of Use apply when you use the services of OpenAI, L.L.C. (snip) By using our Services, you agree to these Terms. (snip) You may not (iii) use output from the Services to develop models that compete with OpenAI. (snip) We may terminate these Terms immediately upon notice to you if you materially breach Sections 2 (Usage Requirements).




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: