About the Current Project
DBpedia is a crowd-sourced community effort to extract structured content from the information created in various Wikimedia projects. This structured information resembles an open knowledge graph (OKG) which is available for everyone on the Web. A knowledge graph is a special kind of database which stores knowledge in a machine-readable form and provides a means for information to be collected, organised, shared, searched and utilised.
- The use of knowledge graph requires no time for training but only needs time of the graph construction process. The service is therefore capable of being online quicker.
- The service can check whether given words, set of words seperated by delimeter are valid ngrams or not. The service is also capable of checking validity of the part-of-speech(POS) of the words entered by the user.
- The service can extract valid N-gram from sentences,words or set of words entered by the user making the sentence grammatically correct.
Our research aims to bring a sentence similarity service which would measure the closeness of two or more sentence or paragraph using Natural Language Processing and WordNet
Dr. Maiga Chang is a Full Professor in the School of Computing and Information Systems at Athabasca University, Canada.
The research focuses on creating a service capable of verifying valid n-grams from a given set of words. The service is capable of extracting valid n-grams and their part of speech(POS) from the words provided by the user which can be used for verification purposes.
Rob Schmidt is an Athabasca University undergraduate student from Calgary, Alberta, Canada. He will be pursuing a master's degree in Computer Science and has a particular interest in game based design, learning and research.
Bhavesh Gandhi is an undergraduate student. He is pursuing Electrical and Electronics Engineering from Heritage Institute of Technology, India. His research interest lies in the domain of Machine Learning and Natural Language Processing.
- Stage - 1: Automated System to extract and store Valid N-grams and their POS tags from DBpedia.
- Stage - 2: Developing the API service.
Stage 1's major features include (but not limited to)
- To extract and store Valid N-grams and their POS tags from DBpedia.
- Cron jobs for the backend services.
- Dashboard that shows backend services' working progress.
Stage 2's major features include (but not limited to)
- Developing an API service.
- Using the stored N-grams and their POS make a service for users to get the desired information.
Frequently Asked Questions
What can be the service used for ?
The service can be used for extracting and validating N-gram and the most frequent POS(part-of-speech) tags.
What does the sentence similarity service does ?
The sentence similarity actually calculates the similarity between two sentences and assign a score to the overall result.
How the current service is different from the other services ?
The current service uses the valid N-gram learning service to filter out the sentences, which means if there is some word which makes no sense, then it will be remove from the sentence preserving the position of the words.The service can also be useful to extract and check the grammatical correctness of the sentences (upto 4-grams).
How can i use this service ?
Please go to the HOW-to section and find the necessary documentation for each of the service and how to use them.
Can there be frequent updates in the services ?
Yes, there can be updates in the service or a new service , the webpage will also be updated with the latest information of the new service.