Hello, it has been a while since I updated this blog. I’m a little busy with college stuffs and something like that. And finally, I have came to the last year of my graduate study. After doing some consultations with some professors in my college, I got something as my research focus. Actually, it still at proposal stage, but I hope this will works, because so many people are counting on me about it.
So, I wanna implement MapReduce to optimize processing in automatic part-of-speech tagging (POS tagging). POS tagging is a process of assigning types of words in entire collection of text document. To make the process automatic, we can use some approaches that involves natural language processing techniques. Some approaches involve supervised learning, it means it needs to train the models with tagged corpus before we use the models to tag the real world text document. We can use MapReduce to optimize the learning and the real tagging process.
Since this is my first time dealing with (yeah) MapReduce and natural language processing, I feel a little bit anxious. Even, my anxiety is taking over my excitement already. Hearing this, maybe you’ll say how come I feel anxiety more than excitement. The answer is “I don’t know”, but I hope this will works out and I can finish the research on time. Oh, maybe because there is time variable. Well, if we don’t have time variable then when we will start to do the work?
Well, this is just me rambling around. Thank you for all the readers who have asked some questions, comments, and anything in this blog. I hope we can keep in touch. Wish me luck. I’ll write about my research little by little in this blog. So, be aware.. And let’s get started!!
Here is the situation. You are going to do an important presentation in an international conference. You have made your presentation slides, it’s like the greatest presentation in the universe. You made the presentation using the latest version of Microsoft PowerPoint or OpenOffice.org Impress. You double checked your presentation and laptop right before you’re doing your presentation. Suddenly, out of nowhere, your laptop crash, error, bsod, or anything. You have no choice, you transfered your slides file to another computer that can do presentation. Unfortunately, the computer didn’t have the program that can open your slides. The computer has the older version of PowerPoint that couldn’t open your slides. Or maybe the computer using another operating system that your presentation program didn’t support. You’re panic and can’t think clearly. Everything went dark and suddenly you passed out. Continue reading
About a year ago, I started an English conversation club with my friend in the college. Well, it’s not a real conversation club actually. Instead of doing direct conversation, we’re using instant messenger or irc. Later, I realized that I was doing it because of my own selfishness. I did it because I was lonely at my home, while my friends are on the other city. As expected, the club didn’t last in the long run. My friends became busier with their job and I couldn’t control them. Maybe they were eager to learn English, but they have their own business.
Now in my graduate college, I found that my classmates also trying to run an English conversation club. The great things about it, they are eager to learn how to speak and we can do it directly. We do it in Sunday morning under the tree in the campus’s park. I come twice and I started to love this club. And last day, we’re talking about my favorites: blog. Continue reading
When I was in my college, I tried to implement Web Map Service (WMS) and Web Feature Service (WFS) as a foundation for a distributed Geographical Information Systems (or better known as GIS). My academic advisor at that time told me that this idea is not entirely new, but there are still a lot of people didn’t know about it yet. So with this topic as my thesis, he wished that one day people will know about this technology.
The implementation that I made was quite simple actually. But let me tell you the complete story. At first, I was thinking about develop a geographical operation that can be operated via web service in the clouds. After some weeks of analyzing and gathering informations, I found out that this work could be really hard and time consuming. I didn’t have background in geography–I’m a computer science student–and I didn’t have much time before the next graduation. Finally, I just created a spatial data repository and make it accessible across the network using GeoServer,an Open Source implementation of WMS and WFS. I, then, created a simple web application to pull the spatial data and display it to the browser. I also provided a simple data update feature, utilizing one of the feature of WFS. I used OpenLayers to create the application. It’s really simple actually.
In my graduate study, right now, I want to try something entirely different. I want to explore MapReduce, a programming model for processing a large scale of data in a distributed environment. I heard about this model from some mailing lists and websites, surprised that the paper [pdf], the lecture notes and videos are easy to get. So, for the time being, I decided to do some experiments in order to learn something about it.
It’s still a plan in my head actually. I never talked about it to my thesis advisor (because I have none yet). But I can predict some problems that I will be dealing with if I do this research plan. They are:
- The case. I don’t have any idea about the case that I should solve with this research. My college’s advisor told me about doing something in bioinformatics like genome assembling. I think I will cosider it. But I’m open for an idea.
- The machine and its network. The lab are always busy with the other graduate student. Fortunately, one friend of mine told me that there is another place that I can use in the campus to do experiments. But I should create a permission letter first. Okay, I’ll do it.
In the mean time, I’ll focus myself to learn about MapReduce. Maybe I’ll post something about it in this blog. If you have a suggestion about what should I do with this programming model, let me know. I’d be really glad to hear it.
EDSAC pictures, copyrighted Computer Laboratory, University of Cambridge, licensed under the Creative Commons Attribution 2.0 Generic license.
It has been a long time since my last post in this blog. I just wrote some posts in my old blog when suddenly it’s already 2010. Wow, time really waits for no one. So in this post I wanna sum up what has been happening in the last year. Well, I don’t actually do anything big actually, but I made some progress in live, I guess.
Here I come. I spent my first months in 2009 in my homeland. I was recovering from my illness and successfully gained some weights. I did some theme for this blog, exploring with domain and paid-hosting, maintaining my virtual social live in Facebook and Twitter. Anything that can be done in the house, I’ve done it. Oh yeah, I also created a simple blog aggregator or usually called planet to lists all of my class’s blog posting. Continue reading