Augmented Reality is live view of the real world whose elements are supplemented/augmented by a computer. This concept is found in many games, one of which is PokemonGO.
In our exercise, we were tasked to augment an image to the chess board pattern and when the pattern is rotated, the image supplemented actually rotates alongside the pattern.
Tuesday, November 29, 2016
Exer11: Chroma Keying
In this exercise, we worked in groups to create a video mainly made by using chroma keying. Chroma keying is a way to detect a certain color in an image and replacing the detected color with another image/image sequence. This is commonly used in fantasy and sci-fi movies.
In our chroma keying video, we were assigned to copy and make our version of K-On's opening theme.
In our chroma keying video, we were assigned to copy and make our version of K-On's opening theme.
For the exercise, we used a dark green background.
Exer10: Color Tracking
In this exercise we were tasked to track an object using its color. To do so, we put a small, black rectangle in every frame of the image sequence (although not seen in the image lol). The object to be tracked will be put in the frame in a way that the black rectangle covers the object.
When a colored object (non-black) is put in the region of the rectangle, the rectangle will change to the color of the image. The colored object will then be tracked when moved in the image sequence.
Small, black rectangle is somewhere in there lol |
When a colored object (non-black) is put in the region of the rectangle, the rectangle will change to the color of the image. The colored object will then be tracked when moved in the image sequence.
Exer9: Background Subtraction
In this exercise, we were tasked to get the foreground pixels (image) in a video or sequence of images.
First, it was important that we get the background model of the image sequence. To get the background model, we took a number of frames in the image sequence, converted it to grayscale, and computed the median of the grayscale pixels.
Next, this background model was subtracted to every frame in the image sequence. The difference per pixel is then compared to the given threshold. If the difference is greater than the threshold, it is assigned as a foreground pixel. Else, it is a background pixel.
First, it was important that we get the background model of the image sequence. To get the background model, we took a number of frames in the image sequence, converted it to grayscale, and computed the median of the grayscale pixels.
Background Model |
Next, this background model was subtracted to every frame in the image sequence. The difference per pixel is then compared to the given threshold. If the difference is greater than the threshold, it is assigned as a foreground pixel. Else, it is a background pixel.
Monday, November 7, 2016
Exer8: Pseudo Coloring (False Coloring)
Exer7.5: Optical Character Recognition part 2
In this exercise, we were tasked to detect what characters are present in the given set of plate number images using tesseract.
Before using tesseract, the images used had to be preprocessed first to ensure that the tesseract only reads/detects the plate number and not the rest of the image.
Before using tesseract, the images used had to be preprocessed first to ensure that the tesseract only reads/detects the plate number and not the rest of the image.
Image 1
Region of Interest |
Image 1 Results:
Image 2
Region of Interest |
Image 2 Results:
Image 3
Region of Interest |
Image 3 Results:
This is done to all the given images. It can be seen that the results are not always right. The results that tesseract gives highly depends on how the region of interest is preprocessed.
Subscribe to:
Posts (Atom)