+ All Categories
Home > Documents > Final report SIP-Himanshu

Final report SIP-Himanshu

Date post: 15-Apr-2017
Category:
Upload: himanshu-bhatia
View: 44 times
Download: 0 times
Share this document with a friend
26
FINAL REPORT 1 FINAL REPORT Student Name: BHATIA HIMANSHU VINODKUMAR Enrollment Number: 1011517034 Specialization: DECISION SCIENCE Report Duration: 4 th April, 2016 to 19 th June, 2016 Schemopedia.org Company Name: SCHEMOPEDIA.ORG Mentor Name: MR. ABHISHEK RATHI Submitted to, Prof. Chinmay Baxi
Transcript
Page 1: Final report SIP-Himanshu

FINAL REPORT 1

FINAL REPORT

Student Name: BHATIA HIMANSHU VINODKUMAR

Enrollment Number: 1011517034

Specialization: DECISION SCIENCE

Report Duration: 4th April, 2016 to 19th June, 2016

Schemopedia.org

Company Name: SCHEMOPEDIA.ORG

Mentor Name: MR. ABHISHEK RATHI Submitted to,

Prof. Chinmay Baxi

Page 2: Final report SIP-Himanshu

FINAL REPORT 2

ACKNOWLEDGEMENT

I would like to express my gratitude to all. I want to thank SBS Ahmedabad

for granting me the permission to commence this project.

I am deeply indebted to my supervisors Prof. Chinmay Baxi (Faculty

guide) and Mr. Abhishek Rathi and Mr. Anand Trivedi (Company Guide) for

their help, stimulating suggestions and encouragement in all the time of

internship.

Especially, I would like to thank my parents and friends for extended me

their moral support.

Page 3: Final report SIP-Himanshu

FINAL REPORT 3

TABLE OF CONTENT

TITLE PAGE

NO.

Introduction to company 04

Vision & Potential Revenue streams 05

Problems 06

Project Detail & Objectives 07

Task Assigned & Strategy 08

Mid-Course Correction 13

Limitation 15

Learnings 16

Conclusion 17

Annexure-1- Revised BIBLE OF CODE 18

Annexure-2- PREPARED GUIDELINES &

CHECKLIST FOR SCHEME LISTING & CLEANING 20

Page 4: Final report SIP-Himanshu

FINAL REPORT 4

INTRODUCTION TO COMPANY

Schemopedia.org

Name of the Organization: - Schemopedia.org

Main activity of the Organization: Building Government to Citizen (G2C)

portal for scheme information.

Name & Designation of Head of the Organization:

Mr. Anand Trivedi and Mr. Abhishek Rathi – Co-founders.

Email ID:

[email protected] & [email protected]

“Schemopedia” is a start-up company which is preparing a portal for the Indian

citizens for create awareness about the schemes of the government as well as

Non-Government.

Schemopedia is a one-stop platform for facilitating simplified information access

and convenient application to all schemes / benefits extended to citizens in their

capacity as an individual or an enterprise owner.

Schemopedia, within the umbrella concept of Governity, is the first step towards

bridging this information gap. Governity, as a whole is the concept through which

we envision advanced engagement levels between Government and Citizens.

Schemopedia is collecting data of schemes, so that they can cover each and every

scheme which are beneficial for the Indian citizens.

Page 5: Final report SIP-Himanshu

FINAL REPORT 5

VISION

To constantly innovate and build open source technology platforms that facilitate

interaction between Governments, Citizens and Development Agencies to

improve the flow of information, entitlements and services as well as duties and

obligations.

POTENTIAL REVENUE STREAMS

Benefit Access / Scheme application fees - different rates for individuals /

enterprises

Subscription fees from Government / Policy makers for analytics use

Paid campaigns & surveys for the Government

Paid Participatory scheme design / budgeting exercises for elected

representatives

Page 6: Final report SIP-Himanshu

FINAL REPORT 6

CURRENT SCENARIO BETWEEN GOVERNMENT & CITIZEN

Current Scenario between Government & Citizen

PROBLEMS

Very limited technology used in enabling above interaction on the part of

the Government

Information spread across 500+ Govt. websites and multiple offline

sources and local language support, non-friendly interface, etc.

Low citizen awareness leading to even lower benefit access leading to

hundreds of underutilized schemes / programs

Process complexity etc.

All in all, lack of Citizens’ “pull” causing complacency in Govt. reach out

Page 7: Final report SIP-Himanshu

FINAL REPORT 7

PROJECT DETAILS

Data mining, marketing planning and Implementation - Enabling the

Schemopedia launch

OBJECTIVES

To build the complete database of schemes from various government

Ministries and Non-government websites.

To check and confirm no single scheme missing from those websites.

To confirm all listed schemes are in required format of Schemopedia.org

To compile all listed schemes database in single sheet.

To build complete product database of Schemopedia.org by completing all

above objectives

Page 8: Final report SIP-Himanshu

FINAL REPORT 8

TASK ASSIGNED AND STRATEGY

To achieve all above mentioned objectives we have taken four steps

below

Database Creation (Scheme listing)

Data Verification (Data Validation)

Data Cleaning

Data Compiling

Analysis of the Bill Order Quantity data of Gubbi Group

Unique ID generation and cross-mapping for scheme listing

Phase 1 - Database Creation

In the initial stage of my internship I have to analyze the portal as user interface.

In that procedure I have done the analysis of the portal by checking scope of

improvement of the portal and to list down the advantages & disadvantages of the

portal. I have to list down the schemes from various ministries of Central

Government. As to create the database for schemopedia.org, the scheme listing is

the base for the product (Schemopedia.org)

Page 9: Final report SIP-Himanshu

FINAL REPORT 9

Process Followed for Finding Schemes

GOI Directory (goidirectory.nic.in) which includes various ministries select

among them

For e.g. Ministry of New and Renewable Energy is elaborated below

GOI Directory – Ministries - Ministry of New and Renewable Energy includes

various departments like- Ministry, Autonomous / Apex Institutions, PSUs / Joint

Ventures, Companies & Schemes/Programs & Missions.

The Scheme listing procedure to entry into the structured format which is similar

to the portal of Schemopedia.org

The procedure follows with profile filter, purpose, domain and scheme details.

For entering profile filter the bible of code is prepared as reference.

Revised Bible of Code (given in Annexure 1)

Guidelines, Set of Instructions & Checklist for Scheme listing &

cleaning (given in Annexure 2)

Government of India Web Directory

Union Government

Autonomus/ Apex Institutions

PSUs / Joint ventures

Page 10: Final report SIP-Himanshu

FINAL REPORT 10

SCHEME LISTING PROCEDURE

Read & understand the Scheme

Go through Guidelines, checklist & set of instructions

Find out the purpose of the scheme?

Who is beneficiary?

What is the eligibility of the beneficiary?

What benefit is availed to the beneficiary?

Scheme listing from various Ministries of Central Government -

done by me are as follows:

Ministry Name Entries

1. Ministry of Civil Aviation 150

2. Ministry of Culture 215

3. Ministry of Environment, forest and climate change 085

4. Ministry of Power 050

5. Ministry of Tourism 015

6. Ministry of Rural Development 050

Scheme listing of various Ministry from Central government.

Analysing the portal as User Interface

Prepared Guidelines & Checklist for the interns and employees

Data Cleaning

Data analysis

Page 11: Final report SIP-Himanshu

FINAL REPORT 11

Training the interns for Scheme listing & Cleaning

Prepared exhaustive list of schemes for uploading

Phase 2 - Data Validation

1. DATA VALIDATION: Data Validation is the process of ensuring that a

programme operates on correct and useful data. Data validation ensure

checking for correctness, meaningfulness and security of data that are input

to the system. Data validation checks that data are valid, sensible, secure,

and reasonable before they are processed and also check the errors like

flow / logical errors.

IMPORTANCE OF DATA VALIDATION

Incorrect data validation can lead to data corruption or security

exposure.

Meaningless, incorrect data give wrong output in analysis.

Data validation helpful in minimizing the data cleaning process.

2. DATA VERIFICATION: Data Verification is the process of ensuring

that the data collected from the right source or location from where we

really need and check the listed schemes are really correct or not. It makes

data correction which is misunderstood by the time of data listing and helps

to resolve the errors.

IMPORTANCE OF DATA VERIFICATION

Data correction

Helpful in getting Missing data from the source.

Page 12: Final report SIP-Himanshu

FINAL REPORT 12

PHASE 3: - DATA CLEANING AND COMPILING

Data cleaning here refers to the final cleaning of created database, as to upload

on the portal of Schemopedia.org.

Data Cleaning refers to detecting and correcting the errors and to validate it for

further verification according to the rules and regulation.

In this, Phase-1 have to carry out the following works:

Compilation of schemes done by all interns

To check mapping of schemes is done or not

To clean the Scheme according to the checklist prepared

To fill up the empty cell

To check each filter according to revised bible of code

I have trained the other interns of Schemopedia.org for the Scheme listing and

cleaning.

I have prepared the exhaustive list of Schemes from 3025 entries – All 3025

entries are cleaned and ready for uploading on portal of Schemopedia.org

OTHER WORKS AS FOLLOWS:

Exercise conducted for Learning data analysis using excel tool: Data set from

a survey conducted among newspaper readers on the perception and feedback

of a daily supplement of a Gujarati newspaper - City Bhaskar. It is a

supplement of Divya Bhaskar newspaper.

Page 13: Final report SIP-Himanshu

FINAL REPORT 13

MID-COURSE CORRECTION

PHASE 1 - DATABASE CREATION: -

In this phase we were surfing on internet for various schemes and listing it after

analyze the scheme form various government and Non-government websites.

That listed file than checked by superior. While checking that file they found

some basic problems like grammatically errors and also some missing content so

they give it back to us to resolve those errors.

In the initial period frequency of errors was quite more but then after day by day

with having concentrated mind and because of practice it is minimized in such a

way that they just checked and make it confirm instantly.

PHASE 2 - DATA VALIDATION: -

In this phase the work done in 1st phase by individual is altered with colleagues

so that cross check can be done by this missing content and schemes can be find

out easily.

It is found that some work debate origin in this phase due to having some pointing

out errors but it was obvious and thus we could realize that is actual errors in

some case and make it correct at that time.

Page 14: Final report SIP-Himanshu

FINAL REPORT 14

PHASE 3 - DATA CLEANING AND COMPILING

In this phase there was maximum work concentration required because it was the

period when all work assembled and cleaning process took place on that. Thus

we found that though after giving 100% work efficiency we couldn’t found

accuracy required by superior and always superior give that work back to us for

doing again.

But having very good coordination among team and having good team effort we

make it possible. So, finally we made whole product ready in the last week of our

internship.

Page 15: Final report SIP-Himanshu

FINAL REPORT 15

LIMITATION

There were few limitations also there during work period of internship which is

mentioned below in brief.

Internet speed issues

Data transferring issues

Data missing issues

Lack of inbuilt Software

Lack of supervisor assistants

Non availability of PC’s

Page 16: Final report SIP-Himanshu

FINAL REPORT 16

LEARNING’S

Learnt to do Data mining from unstructured data.

Cultivated habit of Critical thinking while preparing guidelines, checklist

& questionnaire.

Good grip on various functions of Microsoft excel.

Time and Stress management: - Learnt to prioritize the task according to

time (deadline)

Learnt to remove the errors while cleaning the data

How to co-ordinate with other interns to complete the task

Page 17: Final report SIP-Himanshu

FINAL REPORT 17

CONCLUSION

Internship at Schemopedia for 11 weeks started from 04th April 2016 to 19th June

2016 has been a good learning experience for me. During this period, I came

across a number of good people, in network who helped me and also knowledge

sharing with them would be treasure for me.

Exposure to corporate sector during my internship had helped me to understand

the art of balancing the task and time. So that I could manage with target and

mind stress during my internship.

Experience in a corporate sector, is a good exposure about what really the

corporate life is about. The person who can manage these target and stress can

survive in these corporate world. The only thing required in corporate world is

the right attitude or a positive attitude towards attaining one’s goal.

Page 18: Final report SIP-Himanshu

FINAL REPORT 18

ANNEXURE 1 – REVISED BIBLE OF CODE Revised Bible of Code

1st Profile 2nd Profile Filter 3rd profile filter 4t 5th

Filter

Enterprise

Area of business Type of Implementing Depends

upon the

profile

Manufacturing Co-operative Society

Miscellaneous

s

Services SHG

Infrastructure Large

Institute/ Research Labs Medium

Trading Small

Agriculture Micro

Association Start up

NGO Professional

No restriction Private developer

SPV of Industries

R&D Institutes

Association Bodies

Academic Institutes

Joint Liability Organization

Family

Caste / Category type Family Characteristics

APL Family Income less

BPL Family Size

EBC No restriction

SEBC

No restriction

Schedule Caste

Schedule Tribe

Neo middle class families

(Tribal APL families with

income limit of Rs. 2 (Non

Income Tax paying) Other

Backward Castes

(Notified OBC) - DE

notified

Individual

classesCaste/ Category

type

Specification

APL Gender (Male / Female)

BPL Age

Page 19: Final report SIP-Himanshu

FINAL REPORT 19

EBC No restriction

SEBC

No restriction

Schedule Caste

Schedule Tribe

Minorities

Physically & Mentally

Notified - DE notified

Other Backward Castes

Please note that in case of Individual, "(OBC) Specification" only means Age OR

Gender. If an

entry says "Students", it will be covered in 4th / 5th filter

Page 20: Final report SIP-Himanshu

FINAL REPORT 20

ANNEXURE 2 - PREPARED GUIDELINES & CHECKLIST FOR SCHEME LISTING & CLEANING

Guidelines

Read the scheme thoroughly

Understand the scheme – Find out the following from the scheme - What is the

objective of the scheme?

From whom the scheme is for?

What are the eligibility (required criteria for getting benefit of the scheme)? -

What benefit is given to the eligible applicant?

Profile filters – 1st – read the scheme and decide who is the beneficiary from the three options

below: - for e.g. - Sample file – Scheme no 1

Individual

Family

Enterprise

Profile filters – 2nd & 3rd – depends upon the scheme and must be relevant to 1st profile filter

– for e.g. Sample file – Scheme no 1

Take reference of Bible of code

In case the scheme includes the detail of BPL / APL and caste in this scenario the

multiple entries will take place

Reviewing ministry webs ite along with all its departments / corporations

Estimate the number of schemes to be listed

Make daily plan for entire ministry and estimate no. of days of work

Start Scheme listing as per the guidelines / checklist given

Spell Check / Grammar Check for the entire entry / set of entries

Cross verification of the entries with the link for missing columns / data

Final mapping with the guidelines for corrections

Page 21: Final report SIP-Himanshu

FINAL REPORT 21

Profile filters – 4th & 5th – This is named as miscellaneous and depend upon the scheme

How to enter “By Purpose” - for e.g. Scheme no 1 to 5 of the Sample file.

Page 22: Final report SIP-Himanshu

FINAL REPORT 22

The description of “By Purpose” column must include in a single sentence, the following things about the

scheme – o What Benefit is being given (e.g. Financial Assistance),

In what form (e.g. in the form of Grant),

for what purpose / reason (e.g. for purchase of hand weaving machinery) - Who is

the beneficiary (e.g. to BPL artisans)?

So, “Financial assistance in the form of Grant for the purchase of hand weaving machinery to BPL artisans”

Keep by purpose in short and simple language.

In case of multiple entries of single schemes with same set of initial filters and similar

looking “By Purpose” and “Scheme Title, use * for highlighting

Only the differentiated word must be coded between * for highlighting the difference of

the multiple entries

How to enter Domain – For e.g. Scheme no 4 in the sample file

Domain must be from the list of domains as visible on the Portal’s main page

All possible options after going through the scheme need to be mention in domain

separated by comma e.g. Education, Labour and Employment

Master UID

Give the name as follows: - Department Name_Guj / Eng_001_State name

Unique number Sr.no – 1,2,3,4…….

Central / State

Sponsors of the scheme (Any one of the given either central or state)

State - for e.g. Scheme no 4 of Sample file

State name to be entered if you are listing Schemes of State Government

In Central Government Schemes, if the states / region are mentioned then those states /

region to be mention in state column otherwise enter “All States”

Department Name to be showcased in the portal

Department name must be the one from where you got the scheme

Sub department name_1 & 2 - for e.g. Sample file

If sub-department is given, then enter it in column sub – department _1

If there is further classification of sub – department_1, then enter that sub- department

in column named sub- department _2

Scheme title to be displayed - For e.g. – Scheme no 2 & 3 of the sample file.

Scheme title must be similar to the scheme purpose

Page 23: Final report SIP-Himanshu

FINAL REPORT 23

In case the scheme name is different & unable to understand easily – we are supposed to

enter the scheme name & then the purpose of it to give it a proper title - Keep the Scheme

Title in short and simple language.

The “Scheme Title” column must include in a single sentence, the following things about

the scheme –

What Benefit is being given (e.g. Financial Assistance),

In what form (e.g. in the form of Grant),

For what purpose / reason (e.g. for purchase of hand weaving machinery)

Who is the beneficiary (e.g. to BPL artisan)

So, “Financial assistance in the form of Grant for the purchase of hand weaving machinery to BPL artisans”

In case of multiple entries of single schemes with same set of initial filters and similar

looking “By Purpose” and “Scheme Title, use * for highlighting

Only the differentiated word must be coded between * for highlighting the difference of

the multiple entries

Eligibility - For e.g. Sample file

Eligibility criteria of the beneficiary to be entered

The eligibility must be as per profile filters also if NO eligibility criteria is given in scheme.

(for e.g. if the profile filter has said SC and eligibility on the website does not mention SC,

still we must write that “Beneficiary should be from SC category.)

In case the eligibility is mapped with benefits in tabular format, then form them in

sentence for listing and keep the order of entries same as those mentioned in table - In

case of multiple points # to be used.

Use “Alt + Enter” for moving to next line and keeping the entries well formatted - Enter

required eligibility criteria only

Beneficiary

The one who is getting the benefit of the scheme

Benefit type - the benefit type must be any of the following

Financial Assistance

Products

Services

Awards & Recognition

Description of Benefit - For e.g. Sample file – Scheme no 3

Page 24: Final report SIP-Himanshu

FINAL REPORT 24

Benefit of the scheme as given to be entered

Mention the main benefit right at the beginning and then the details later.

In case the benefit is given in tabular form then form them in sentence for listing and keep

the order of entries same as those mentioned in table while entering eligibility

In case of multiple points # to be used.

Use “Alt + Enter” for moving to next line Link – for e.g. Sample file

Link of the scheme to be entered in link column

If the link of scheme is in pdf form, then page number of the scheme to be entered

Conditionality

Conditionality is to be written which are related ONLY to the benefit and not regarding

any other aspect like application process, eligibility, etc. - In case of multiple points # to

be used.

Use “Alt + Enter” for moving to next line Annexure link

If the annexure is provided in the scheme, then enter the link of annexure with page number.

Possible Duration

As mentioned in the scheme the operative period of the scheme has to be entered

How to Apply

Enter the link of the application form and details

In case the scheme is in pdf form and having application form enter the page number of

application form and link with other details

In case of multiple points use # as bullet

Use “Alt + Enter” for moving to next line just before entering the link

Sponsoring Agency

Government / NGO - Any one of them to be entered.

Set of Instructions

When the beneficiary of the scheme is one which is government body, then NO LISTING

of that scheme is required.

When the scheme includes bidding process or tender based selection of the beneficiary,

then NO LISTING of that scheme is required.

Profile filters - In profile filters 1st three are mandatory and to be entered with reference

of bible of code.

Page 25: Final report SIP-Himanshu

FINAL REPORT 25

“By Purpose” - DO NOT simply copy paste the “By Purpose” description from the scheme

but think once if the same is relevant. In most cases, the same will be relevant but in some

cases, additions / deletions might be necessary. Use your discretion.

“By Domain” - If the scheme is leading to multiple domains, enter all in main domain

column separating by comma (,)

“State” - If no states are given in scheme of central government – enter All states - Scheme

title - DO NOT simply copy paste the “By Purpose” description to “Scheme Title to be

displayed” and think once if the same is relevant. In most cases, the same will be relevant

but in some cases, additions / deletions might be necessary. Use your discretion.

Eligibility – Cover the required points of the profile filters and ensure all the points start

with the bullet #

Description of benefit – Mention the main benefit right at the beginning and then the

details later. And ensure all the points to be bulleted with using # only

Page number of the link is must when the scheme is in pdf form

In cases wherever the benefit is “Training” or “Scholarship”, the duration or standard for

which it is applicable needs to be mentioned in the “Benefit

Description”

In some cases, the Website structure is such that, navigating back to the original scheme

becomes difficult even after clicking on the link. E.g.

http://rubberboard.org.in/Scheme.asp?Id=4

In such cases, kindly also mention in the link column – the process to reach the

appropriate scheme e.g. Click on _____ the below link AND THEN, Click on ______ to reach

the Scheme description.

Regarding Scheme Duration, if a scheme is mentioned to have been expired – (a) Search

Google to check if the scheme is currently running?

(b) If not found, then ENTER the scheme and we shall verify it during validation exercises to

the Government offices

Checklist for Cleaning of Data entries

Mapping of possible entries

Profile filter – reference bible of code

Easy understandable language and in short for by purpose, Scheme title, eligibility & description

of benefit.

Page 26: Final report SIP-Himanshu

FINAL REPORT 26

Scheme title: only specific schemes name to be added

In case of multiple entries of single schemes with same set of initial filters and similar looking “By

Purpose” and “Scheme Title, use * for highlighting” and # for bullet points in eligibility, description

of benefit & how to apply

Unnecessary space needs to be removed

Benefit type: - Financial assistance, Product, Services & Awards / recognition.

Page number of scheme to be provided in the link column (if file is in pdf form - too large with

multiple schemes)

NO cells are to be left EMPTY

Cross verification of entries with link

Spell check & Grammar Check the entire data sheet at the end of every day before saving the

sheet


Recommended