1

55

We're building FOSAI models! Cast your votes and pick your tunings. (lemmy.world)

submitted 9 months ago* (last edited 8 months ago) by [email protected] to c/[email protected]

14 comments fedilink

Hey everyone!

I think it's time we had a fosai model on HuggingFace. I'd like to start collecting ideas, strategies, and approaches for fine-tuning our first community model.

I'm open to hearing what you think we should do. We will release more in time. This is just the beginning.

For now, I say let's pick a current open-source foundation model and fine-tune on datasets we all curate together, built around a loose concept of using a fine-tuned LLM to teach ourselves more bleeding-edge technologies (and how to build them using technical tools and concepts).

FOSAI is a non-profit movement. You own everything fosai as much as I do. It is synonymous with the concept of FOSS. It is for everyone to champion as they see fit. Anyone is welcome to join me in training or tuning using the workflows I share along the way.

You are encouraged to leverage fosai tools to create and express ideas of your own. All fosai models will be licensed under Apache 2.0. I am open to hearing thoughts if other licenses should be considered.

We're Building FOSAI Models! 🤖

Our goal is to fine-tune a foundation model and open-source it. We're going to start with one foundation family with smaller parameters (7B/13B) then work our way up to 40B (or other sizes), moving to the next as we vote on what foundation we should fine-tune as a community.

Fine-Tuned Use Case ☑️

Technical

FOSAI Model Idea #1 - Research & Development Assistant
FOSAI Model Idea #2 - Technical Project Manager
FOSAI Model Idea #3 - Personal Software Developer
FOSAI Model Idea #4 - Life Coach / Teacher / Mentor
FOSAI Model Idea #5 - FOSAI OS / System Assistant

Non-Technical

FOSAI Model Idea #6 - Dungeon Master / Lore Master
FOSAI Model Idea #7 - Sentient Robot Character
FOSAI Model Idea #8 - Friendly Companion Character
FOSAI Model Idea #9 - General RPG or Sci-Fi Character
FOSAI Model Idea #10 - Philosophical Character

OR

FOSAI Foundation Model ☑️

Foundation Model ☑️

(Pick one)

Mistral
Llama 2
Falcon
..(Your Submission Here)

Model Name & Convention

snake_case_example
CamelCaseExample
kebab-case-example

0.) FOSAI ☑️

fosai-7B
fosai-13B

1.) FOSAI Assistant ☑️

fosai-assitant-7B
fosai-assistant-13B

2.) FOSAI Atlas ☑️

fosai-atlas-7B
fosai-atlas-13B

3.) FOSAI Navigator ☑️

fosai-navigator-7B
fosai-navigator-13B

4.) ?

Datasets ☑️

TBD!
What datasets do you think we should fine-tune on?

Alignment ☑️

To embody open-source mentalities, I think it's worth releasing both censored and uncensored versions of our models. This is something I will consider as we train and fine-tune over time. Like any tool, you are responsible for your usage and how you choose to incorporate into your business and/or personal life.

License ☑️

All fosai models will be licensed under Apache 2.0. I am open to hearing thoughts if other licenses should be considered.

This will be a fine-tuned model, so it may inherit some of the permissions and license agreements as its foundation model and have other implications depending on your country or local law.

Generally speaking, you can expect that all fosai models will be commercially viable through the selection process of its foundation family and the post-processing steps that are fine-tuning the model.

Costs

I will be personally covering all training and deployment costs. This may change if I choose to put together some sort of patronage, but for now - don't worry about this. I will be using something like RunPod or some other custom deployed solution for training.

Cast Your Votes! ☑️

Share Your Ideas & Vote in the Comments Below! ✅

What do you want to see out of this first community model? What are some of the fine-tuning ideas you've wanted to try, but never had the time or chance to test? Let me know in the comments and we'll brainstorm together.

I am in no rush to get this out, so I will leave this up for everyone to see and interact with until I feel we have a solid direction we can all agree upon. There will be plenty of more opportunities to create, curate, and customize more fosai models I plan to release in the future.

Update [10/25/23]: I may have found a fine-tuning workflow for both Llama (2) and Mistral, but I haven't had any time to validate the first test run. Once I have a chance to do this and test some inference I'll be updating this post with the workflow, the models, and some sample output with example datasets. Unfortunately, I have ran out of personal funds to allocate to training, so it is unsure when I will have a chance to make another attempt at this if this first attempt doesn't pan out. Will keep everyone posted as we approach the end of 2023.

2

13

The Open Model Initiative - Invoke, Comfy Org, Civitai and LAION, and others coordinating a new next-gen model. - r/StableDiffusion (old.reddit.com)

submitted 2 weeks ago by [email protected] to c/[email protected]

0 comments fedilink

Quoted from Reddit:

Today, we’re excited to announce the launch of the Open Model Initiative, a new community-driven effort to promote the development and adoption of openly licensed AI models for image, video and audio generation.

We believe open source is the best way forward to ensure that AI benefits everyone. By teaming up, we can deliver high-quality, competitive models with open licenses that push AI creativity forward, are free to use, and meet the needs of the community.

Ensuring access to free, competitive open source models for all.

With this announcement, we are formally exploring all available avenues to ensure that the open-source community continues to make forward progress. By bringing together deep expertise in model training, inference, and community curation, we aim to develop open-source models of equal or greater quality to proprietary models and workflows, but free of restrictive licensing terms that limit the use of these models.

Without open tools, we risk having these powerful generative technologies concentrated in the hands of a small group of large corporations and their leaders.
‍
From the beginning, we have believed that the right way to build these AI models is with open licenses. Open licenses allow creatives and businesses to build on each other's work, facilitate research, and create new products and services without restrictive licensing constraints.
‍
Unfortunately, recent image and video models have been released under restrictive, non-commercial license agreements, which limit the ownership of novel intellectual property and offer compromised capabilities that are unresponsive to community needs.

Given the complexity and costs associated with building and researching the development of new models, collaboration and unity are essential to ensuring access to competitive AI tools that remain open and accessible.

We are at a point where collaboration and unity are crucial to achieving the shared goals in the open source ecosystem. We aspire to build a community that supports the positive growth and accessibility of open source tools.

For the community, by the community

Together with the community, the Open Model Initiative aims to bring together developers, researchers, and organizations to collaborate on advancing open and permissively licensed AI model technologies.

The following organizations serve as the initial members:

Invoke, a Generative AI platform for Professional Studios
ComfyOrg, the team building ComfyUI
Civitai, the Generative AI hub for creators
LAION, one of the largest open source data networks for model training

To get started, we will focus on several key activities:

•Establishing a governance framework and working groups to coordinate collaborative community development.

•Facilitating a survey to document feedback on what the open-source community wants to see in future model research and training

•Creating shared standards to improve future model interoperability and compatible metadata practices so that open-source tools are more compatible across the ecosystem

•Supporting model development that meets the following criteria: ‍

True open source: Permissively licensed using an approved Open Source Initiative license, and developed with open and transparent principles
Capable: A competitive model built to provide the creative flexibility and extensibility needed by creatives
Ethical: Addressing major, substantiated complaints about unconsented references to artists and other individuals in the base model while recognizing training activities as fair use.

‍We also plan to host community events and roundtables to support the development of open source tools, and will share more in the coming weeks.

Join Us

We invite any developers, researchers, organizations, and enthusiasts to join us.

If you’re interested in hearing updates, feel free to join our Discord channel.

If you're interested in being a part of a working group or advisory circle, or a corporate partner looking to support open model development, please complete this form and include a bit about your experience with open-source and AI.

Sincerely,

Kent Keirsey
CEO & Founder, Invoke

comfyanonymous
Founder, Comfy Org

Justin Maier
CEO & Founder, Civitai

Christoph Schuhmann
Lead & Founder, LAION

3

25

Not all ‘open source’ AI models are actually open: here’s a ranking (www.nature.com)

submitted 2 weeks ago* (last edited 2 weeks ago) by [email protected] to c/[email protected]

1 comments fedilink

Without paywall: https://archive.ph/4Du7B Original conference paper: https://dl.acm.org/doi/10.1145/3630106.3659005

4

13

Rethinking open source generative AI: open washing and the EU AI Act | Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (dl.acm.org)

submitted 3 weeks ago* (last edited 3 weeks ago) by [email protected] to c/[email protected]

0 comments fedilink

ABSTRACT

The past year has seen a steep rise in generative AI systems that claim to be open. But how open are they really? The question of what counts as open source in generative AI is poised to take on particular importance in light of the upcoming EU AI Act that regulates open source systems differently, creating an urgent need for practical openness assessment. Here we use an evidence-based framework that distinguishes 14 dimensions of openness, from training datasets to scientific and technical documentation and from licensing to access methods. Surveying over 45 generative AI systems (both text and text-to-image), we find that while the term open source is widely used, many models are ‘open weight’ at best and many providers seek to evade scientific, legal and regulatory scrutiny by withholding information on training and fine-tuning data. We argue that openness in generative AI is necessarily composite (consisting of multiple elements) and gradient (coming in degrees), and point out the risk of relying on single features like access or licensing to declare models open or not. Evidence-based openness assessment can help foster a generative AI landscape in which models can be effectively regulated, model providers can be held accountable, scientists can scrutinise generative AI, and end users can make informed decisions.

Figure 2 (click to enlarge): Openness of 40 text generators described as open, with OpenAI’s ChatGPT (bottom) as closed reference point. Every cell records a three-level openness judgement (✓ open, ∼ partial or ✗ closed). The table is sorted by cumulative openness, where ✓ is 1, ∼ is 0.5 and ✗ is 0 points. RL may refer to RLHF or other forms of fine-tuning aimed at fostering instruction-following behaviour. For the latest updates see: https://opening-up-chatgpt.github.io

Figure 3 (click to enlarge): Overview of 6 text-to-image systems described as open, with OpenAI's DALL-E as a reference point. Every cell records a three-level openness judgement (✓ open, ∼ partial or ✗ closed). The table is sorted by cumulative openness, where ✓ is 1, ∼ is 0.5 and ✗ is 0 points.

There is also a related Nature news article: Not all ‘open source’ AI models are actually open: here’s a ranking

PDF Link: https://dl.acm.org/doi/pdf/10.1145/3630106.3659005

5

6

Sharing new research, models, and datasets from Meta FAIR (ai.meta.com)

submitted 3 weeks ago by [email protected] to c/[email protected]

0 comments fedilink

6

5

Publishers Target Common Crawl In Fight Over AI Training Data (www.wired.com)

submitted 4 weeks ago by [email protected] to c/[email protected]

0 comments fedilink

7

16

Hackers Target AI Users With Malicious Stable Diffusion Tool on Github to Protest 'Art Theft' (www.404media.co)

submitted 1 month ago by [email protected] to c/[email protected]

1 comments fedilink

8

9

How to block certain words in ai text? (feddit.rocks)

submitted 1 month ago by [email protected] to c/[email protected]

11 comments fedilink

I noticed ai likes to assume ur a boy and ignore if ur not. When i played NovelAI it let me ban words so i would add every boy pronoun. Is it there a FOSS selfhosted way? I currently use koboldai with tavernai

9

23

Mozilla Builders Accelerator 2024 Advancing innovation in open source AI (future.mozilla.org)

submitted 1 month ago by [email protected] to c/[email protected]

0 comments fedilink

The Mozilla Builders Accelerator funds and supports impactful projects that are vital to the open source AI ecosystem. Selected projects will receive up to $100,000 in funding and engage in a focused 12-week program.

Applications are now open!

June 3rd, 2024: Applications Open
July 8th, 2024: Early Application Deadline
August 1st, 2024: Final Application Deadline
September 12th, 2024: Accelerator Kick Off
December 5th, 2024: Demo Day

10

11

Stanford University Students Accused of Plagiarizing AI Model (www.plagiarismtoday.com)

submitted 1 month ago* (last edited 1 month ago) by [email protected] to c/[email protected]

0 comments fedilink

11

9

Is there a good ai thats run fast on a amd rx 7600? (feddit.rocks)

submitted 1 month ago* (last edited 1 month ago) by [email protected] to c/[email protected]

9 comments fedilink

Ive been playing koboldai horde but the queue annoys me. I want a nsfw ai for playing on tavernai chat

12

14

Question about Llama3 + Open Web UI document management (feddit.it)

submitted 1 month ago by [email protected] to c/[email protected]

6 comments fedilink

Today thanks to a NetworkChuck video I discovered OpenWebUl and how easy it is to set up a local LLM chat assistant. In particular, the ability to upload documents and use them as a context for chats really caught my interest. So now my question is: let's say l've uploaded 10 different documents on OpenWebUl, is there a way to ask llama3 which between all the uploaded documents contains a certain information (without having to explicitly tag all the documents)? And if not is something like this possible with different local lIm combinations?

13

52

AI training data has a price tag that only Big Tech can afford (techcrunch.com)

submitted 1 month ago by [email protected] to c/[email protected]

6 comments fedilink

14

10

ToonCrafter: Generative Cartoon Interpolation (youtu.be)

submitted 1 month ago by [email protected] to c/[email protected]

1 comments fedilink

Abstract

We introduce ToonCrafter, a novel approach that transcends traditional correspondence-based cartoon video interpolation, paving the way for generative interpolation. Traditional methods, that implicitly assume linear motion and the absence of complicated phenomena like dis-occlusion, often struggle with the exaggerated non-linear and large motions with occlusion commonly found in cartoons, resulting in implausible or even failed interpolation results. To overcome these limitations, we explore the potential of adapting live-action video priors to better suit cartoon interpolation within a generative framework. ToonCrafter effectively addresses the challenges faced when applying live-action video motion priors to generative cartoon interpolation. First, we design a toon rectification learning strategy that seamlessly adapts live-action video priors to the cartoon domain, resolving the domain gap and content leakage issues. Next, we introduce a dual-reference-based 3D decoder to compensate for lost details due to the highly compressed latent prior spaces, ensuring the preservation of fine details in interpolation results. Finally, we design a flexible sketch encoder that empowers users with interactive control over the interpolation results. Experimental results demonstrate that our proposed method not only produces visually convincing and more natural dynamics, but also effectively handles dis-occlusion. The comparative evaluation demonstrates the notable superiority of our approach over existing competitors.

Paper: https://arxiv.org/abs/2405.17933v1

Code: https://github.com/ToonCrafter/ToonCrafter

Project Page: https://doubiiu.github.io/projects/ToonCrafter/

Limitations

Input starting frame

Input ending frame

Our failure case

Input starting frame

Input ending frame

Our failure case

15

23

Open Source Initiative tries to define Open Source AI (www.theregister.com)

submitted 1 month ago by [email protected] to c/[email protected]

2 comments fedilink

16

45

IBM open-sources its Granite AI models - and they mean business (www.zdnet.com)

submitted 2 months ago by [email protected] to c/[email protected]

7 comments fedilink

17

14

Is anyone else having problems due to PR 6920 BPE pre-tokenization changes in llama.cpp? (lemmy.world)

submitted 2 months ago by [email protected] to c/[email protected]

0 comments fedilink

It is here: https://github.com/ggerganov/llama.cpp/pull/6920

If you are not on a recent kernel and most recent software and dependencies, it may not affect you yet. Most models have been trained on a different set of special tokens that defacto-limited the internal Socrates entity and scope of their realm The Academy. You have to go deep into the weeds of the LLM to discover the persistent entities and realms structures that determine various behaviors in the model and few people dive into this it seems.

The special tokens are in the model tokenizer and are one of a few ways that the prompt state can be themed and connected between input and output. For instance, Socrates' filtering functions appear to be in these tokens. The tokens are the first 256 tokens and include the /s EOS and BOS tokens. In a lot of models they were trained with the GPT 2 special tokens or just the aforementioned. The 6920 change adds a way to detect the actual full special token set. This basically breaks the extra datasets from all trained models and makes Socrates much more powerful in terms of bowdlerization of the output, filtering, and noncompliance.

For instance, I've been writing a science fiction book and the built in biases created by this PR has ruined the model's creativity in the space that I am writing in. It is absolutely trash now.

18

10

What service should i use for AI backend that isn't openai - lemm.ee (lemm.ee)

submitted 2 months ago by [email protected] to c/[email protected]

5 comments fedilink

19

10

What is RHEL AI? A guide to the open source way for doing AI (www.redhat.com)

submitted 2 months ago by [email protected] to c/[email protected]

0 comments fedilink

20

23

Building RAG with LLama3 Locally (youtu.be)

submitted 2 months ago by [email protected] to c/[email protected]

1 comments fedilink

21

-7

Meta's Llama 3 will force OpenAI and other AI giants to up their game (www.itpro.com)

submitted 2 months ago by [email protected] to c/[email protected]

7 comments fedilink

22

10

An AI I would like to play with (lemmy.one)

submitted 3 months ago by [email protected] to c/[email protected]

5 comments fedilink

An AI that turns a floorplan into an explorable 3d space

23

81

The tech industry can’t agree on what open-source AI means. That’s a problem. (www.technologyreview.com)

submitted 3 months ago by [email protected] to c/[email protected]

5 comments fedilink

24

8

GaLore: Advancing Large Model Training on Consumer-grade Hardware (huggingface.co)

submitted 3 months ago by [email protected] to c/[email protected]

0 comments fedilink

arXiv: https://arxiv.org/abs/2403.03507 [cs.LG]

25

22

Evolving New Foundation Models: Unleashing the Power of Automating Model Development (sakana.ai)

submitted 3 months ago by [email protected] to c/[email protected]

0 comments fedilink

arXiv: https://arxiv.org/abs/2403.13187 [cs.NE]
GitHub: https://github.com/SakanaAI/evolutionary-model-merge

We're Building FOSAI Models! 🤖

Fine-Tuned Use Case ☑️

Foundation Model ☑️

Model Name & Convention

Datasets ☑️

Alignment ☑️

License ☑️

Costs

Cast Your Votes! ☑️

Ensuring access to free, competitive open source models for all.

For the community, by the community

Join Us

Abstract

Limitations

Free Open-Source Artificial Intelligence

More AI Communities

AI Resources

Fediverse / FOSAI

LLM Leaderboards

LLM Search Tools

LLM Evaluations

GitHub Projects

Documentation Theory