gokaygokay (gokay aydogan)

posted an update 11 days ago

Post

1399

Reflection Llama 3.1 70B (Correct Weights) on ZeroGPU thanks to llama.cpp and unsloth (for quantization)

ZeroGPU space
- gokaygokay/Reflection-70B-llamacpp

- Working Model
mattshumer/ref_70_e3

- Quantized Models
unsloth/Reflection-Llama-3.1-70B-GGUF

posted an update 13 days ago

Post

2378

FLUX Prompt Generator Updates

- gokaygokay/FLUX-Prompt-Generator

- There are now hundreds of new selections across diverse categories, each offering a lot of choices:

Architecture, Art, Artist, Brands, Character, Cinematic, Fashion, Feelings, Geography, Human, Interaction, Keywords, Objects, People, Photography, Plots, Poses, Scene, Science, Stuff, Time, Typography, Vehicle, Video Game

- In addition to Hugging Face, I've integrated new LLM providers: Groq, OpenAI, and Claude.

- Upgraded Vision Language Models (VLMs): We now feature Qwen2-VL, JoyCaption and Florence-2-large.

- New specialized system prompts for various styles and themes, including Happy, Simple, Poster, Only Objects, No Figure, Landscape, Fantasy.

posted an update about 1 month ago

Post

3688

Stitching / Blending / Sharpening

(I have created 3 spaces, might be useful for some people)

Stitching - gokaygokay/Stitching
Blending - gokaygokay/Blending
Sharpening - gokaygokay/Sharpening

posted an update about 1 month ago

Post

6742

I've built a space for creating prompts for FLUX

gokaygokay/FLUX-Prompt-Generator

You can create long prompts from images or simple words. Enhance your short prompts with prompt enhancer. You can configure various settings such as artform, photo type, character details, scene details, style, and artist to create tailored prompts.

And you can combine all of them with custom prompts using llms (Mixtral, Mistral, Llama 3, and Mistral-Nemo).

The UI is a bit complex, but it includes almost everything you need. Choosing random option is the most fun!

And i've created some other spaces for using FLUX models with captioners and enhancers.

- gokaygokay/FLUX.1-dev-with-Captioner

4 replies

·

posted an update 2 months ago

Post

4505

InSPyReNet Background Removal

I've built a space for fast background removal.

- gokaygokay/Inspyrenet-Rembg

- https://github.com/plemeri/InSPyReNet

posted an update 2 months ago

Post

4578

I've made a creative version of Tile Upscaler

- gokaygokay/TileUpscalerV2

- https://github.com/gokayfem/Tile-Upscaler

- New tiling strategy
- Now it's closer to Clarity Upscaler
- It has more parameters to play and it has more room to fail because of that
- You should try different resolutions, strength and controlnet strength

Original Tile Upscaler
- gokaygokay/Tile-Upscaler

posted an update 2 months ago

Post

6145

Kolors with VLM support

I've built a space for using Kolors image generation model with captioner models and prompt enhancers.

- Space with VLM and Prompt Enhancer
gokaygokay/KolorsPlusPlus

- Original Space for model
gokaygokay/Kolors

- Captioner VLMs
- gokaygokay/sd3-long-captioner-v2

- microsoft/Florence-2-base

- Prompt Enhancers
- gokaygokay/Lamini-Prompt-Enchance-Long

- gokaygokay/Lamini-Prompt-Enchance

posted an update 2 months ago

Post

4988

Kolors: Effective Training of Diffusion Model for Photorealistic Text-to-Image Synthesis

Kolors is a large-scale text-to-image generation model based on latent diffusion, developed by the Kuaishou Kolors team.

Hugging Face Spaces
- gokaygokay/Kolors

Model Page
- Kwai-Kolors/Kolors

posted an update 3 months ago

Post

4007

I've created a space for chatting with Gemma 2 using llama.cpp

- 🎛️ Choose between 27B IT and 9b IT models
- 🚀 Fast inference using llama.cpp

- gokaygokay/Gemma-2-llamacpp

1 reply

·

posted an update 3 months ago

Post

2956

I've created a Stable Diffusion 3 (SD3) image generation space for convenience. Now you can:

1. Generate SD3 prompts from images
2. Enhance your text prompts (turn 1-2 words into full SD3 prompts)

https://huggingface.co/spaces/gokaygokay/SD3-with-VLM-and-Prompt-Enhancer

These features are based on my custom models:

- VLM captioner for prompt generation:
- gokaygokay/sd3-long-captioner

- Prompt Enhancers for SD3 Models:
- gokaygokay/Lamini-Prompt-Enchance-Long
- gokaygokay/Lamini-Prompt-Enchance

You can now simplify your SD3 workflow with these tools!

replied to their post 3 months ago

I think for a 0.22B size model it looks amazing. I saw some very recent 3B even 7B models worst than this and its highly fine-tunable. I've fine tuned it only with 3500 train samples under 15 minutes.

replied to their post 3 months ago

https://huggingface.co/spaces/gokaygokay/Florence-2-SD3-Captioner

replied to their post 3 months ago

They've already fine-tuned the base model and it looks that its better at Segmentation and Object Detection with fine-tuned model. But captions are less detailed and short. Maybe thats a good thing about hallucinations but sometimes fine-tuned model gives almost no details. But for your question it looks like a fine-tunable model.

replied to their post 3 months ago

https://colab.research.google.com/github/google/generative-ai-docs/blob/main/site/en/gemma/docs/paligemma/fine-tuning-paligemma.ipynb

I've used this fine-tuning notebook.

posted an update 3 months ago

Post

5859

I've fine-tuned three types of PaliGemma image captioner models for generating prompts for Text2Image models. They generate captions similar to prompts we give to the image generation models. I used google/docci and google/imageinwords datasets for fine-tuning.

This one gives you longer captions.

gokaygokay/SD3-Long-Captioner

This one gives you middle size captions.

https://huggingface.co/spaces/gokaygokay/SD3-Long-Captioner-V2

And this one gives you shorter captions.

https://huggingface.co/spaces/gokaygokay/SDXL-Captioner

8 replies

·

gokay aydogan PRO

AI & ML interests

Organizations

gokaygokay's activity