Feature #3607: Implementing AI into Hope project - AAC - EXPO projects

Actions

Copy link

Feature #3607

open

Implementing AI into Hope project

Added by Kasthuri G R over 1 year ago. Updated about 1 year ago.

Status:

Assigned

Priority:

Normal

Assignee:

Kasthuri G R

Target version:

Start date:

03/04/2024

Due date:

04/30/2024 (about 15 months late)

% Done:

Estimated time:

Description

Take input words from Hope, generate sentences using large language models (LLMs) with basic commonly used words

Files

Download all files

UI_1.png		Kasthuri G R, 03/20/2024 06:41 PM
UI_2.png		Kasthuri G R, 03/20/2024 06:41 PM
ThankYou.png		Kasthuri G R, 03/21/2024 06:38 AM
Classroom.png		Kasthuri G R, 03/21/2024 06:38 AM
Nice_to_meet.png		Kasthuri G R, 03/21/2024 06:39 AM
Sorry.png		Kasthuri G R, 03/21/2024 06:39 AM
2.keywords-dog-play-ball.png		Kasthuri G R, 05/30/2024 08:36 PM
1.keywords-dog-play-ball-happy.png		Kasthuri G R, 05/30/2024 08:36 PM
3.keywords-I-sleepy-hungry.png		Kasthuri G R, 05/30/2024 08:36 PM
4.keywords-I-pain-sad.png		Kasthuri G R, 05/30/2024 08:36 PM
5.keywords-sorry-helpme.png		Kasthuri G R, 05/30/2024 08:36 PM
6.keywords-we-sorry.png		Kasthuri G R, 05/30/2024 08:36 PM

Actions

Copy link

Updated by Kasthuri G R over 1 year ago

Google has released an open AI model Gemma(https://blog.google/technology/developers/gemma-open-models/). I am able to generate sentences for the given input words.

I have provided the GSheet link below with the input words and corresponding AI results generated by Gemma model

[[https://docs.google.com/spreadsheets/d/1CluoU2D38b6nuRMt4AJDTpU6_GFoB0e3VcQOo0_QJ0Y/edit?usp=sharing]]

This is open source model and it gives expected results. We will have to design the prompts to get better results based on the user needs. I will set a deadline based on the list of features we deciding

Actions

Copy link

Updated by Kasthuri G R over 1 year ago

To get inputs from special educators, made a sample UI in python that user can upload image and it will generate sentences for the given image. I have implemented the functionality. Yet to make the UI better with background color and alignment.

Actions

Copy link Download all files

Updated by Kasthuri G R over 1 year ago

File UI_1.png UI_1.png added
File UI_2.png UI_2.png added

Made a sample UI that allows to give input image and generate sentence. The model output varies like below. I am trying to figure out the method to filter the output.

For eg: Given Prompt "construct 5 basic proper sentences about {input word} and its habits for primary school students"

AI output

Step 1/5
1. Parrots are beautiful birds that come in many different colors and sizes.

Step 2/5
2. Parrots are very smart and can learn many tricks and even talk.

Step 3/5
3. Parrots are social birds and love to be around other parrots and people.

Step 4/5
4. Parrots need a lot of attention and exercise to stay healthy.

Actions

Copy link

Updated by Anil Prabhakar over 1 year ago

Tracker changed from Bug to Feature
Project changed from 16 to AAC
Due date set to 04/30/2024
Status changed from New to Assigned

There should be a Due Date for every task and an updated %Done. Cannot work without any deadlines.

Do we have a creative common license to use these images?
How are you imagining the use case, is this set up by a teacher, and the child clicks on a button, or is the child supposed to upload an image?
This is not picture to speech (think AI into HOPE...as the subject of this Issue). I am expecting a communication tool, where a user selects 1 or more images from the KAVI-PTS library, AI generates a few possible sentences, and the child picks one of them.

Actions

Copy link Download all files

Updated by Kasthuri G R over 1 year ago

File ThankYou.png ThankYou.png added
File Classroom.png Classroom.png added
File Nice_to_meet.png Nice_to_meet.png added
File Sorry.png Sorry.png added

Do we have a creative common license to use these images?
I will use the KAVI Library images

How are you imagining the use case, is this set up by a teacher, and the child clicks on a button, or is the child supposed to upload an image?
This is a sample UI we decided to create and show it to special educators in Mira school to get their ideas on the requirement. So we can plan the development in both Hope and Kavi-PTS based on their feedback

The sample use cases I have considered are below

1. AI to generate Sentences with the "Thank You, Sorry, Nice to meet you" which children can use. These are the pictures under Quick talk in the Kavi Library
2. Common sentences used by the children in Classroom. Class room is also the picture from our image Library

The respective AI responses for the above inputs are uploaded.

Actions

Copy link

Updated by Anil Prabhakar over 1 year ago

Use the > to quote, otherwise your reply is mixed with my comment.
Sentences are quite ambiguous and will become frustrating. We may need more than one image to create a sentence. Let us try with 2 images.

Actions

Copy link

Updated by Kasthuri G R about 1 year ago

When tried to run the python code where it takes input as Image(one) and generate text. It didn't run as huggingface has updated the libraries and the code need to updated to run the model without error. I have fixed the library issue and made the code functional.

I will modify the code to accept two images as inputs to generate text tomorrow

Actions

Copy link Download all files

Updated by Kasthuri G R about 1 year ago

File 1.keywords-dog-play-ball-happy.png 1.keywords-dog-play-ball-happy.png added
File 2.keywords-dog-play-ball.png 2.keywords-dog-play-ball.png added
File 3.keywords-I-sleepy-hungry.png 3.keywords-I-sleepy-hungry.png added
File 4.keywords-I-pain-sad.png 4.keywords-I-pain-sad.png added
File 5.keywords-sorry-helpme.png 5.keywords-sorry-helpme.png added
File 6.keywords-we-sorry.png 6.keywords-we-sorry.png added

I made a UI to display the images. Images can be select/deselect on clicking. Once selecting the images, Clicking on Generate sentences will display the possible sentences with the keywords(image names).

I am sharing the sample outputs of the sentences generated by gemma model.

Prompt used "Generate sentences suitable for children using the all the key words. Make the sentences short, simple, and conversing. ";

Need to try with different prompts and have to run different models to compare the results

Actions

Copy link

Updated by Anil Prabhakar about 1 year ago

Think about your conversation with anyone. When you are trying to communicate with your son, will the prompt be "generate suitable sentence"?

Don't be a programmer, put more thought into how you will use GenAI to build an AAC solution.

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

AAC

Feature #3607

Implementing AI into Hope project

Updated by Kasthuri G R over 1 year ago

Updated by Kasthuri G R over 1 year ago

Updated by Kasthuri G R over 1 year ago

Updated by Anil Prabhakar over 1 year ago

Updated by Kasthuri G R over 1 year ago

Updated by Anil Prabhakar over 1 year ago

Updated by Kasthuri G R about 1 year ago

Updated by Kasthuri G R about 1 year ago

Updated by Anil Prabhakar about 1 year ago