Post on 14-Oct-2020
transcript
Unit-8
Current Developments
Compiled By:
Dr. Vanita Joshi
Unit-8 Currrent Development
1
Content
• E-business
• Cloud Computing
• Big data
• Wireless Sensor Network
• Data Warehousing
• Data Mining
• Green Computing
Unit-8 Currrent Development
2
Unit-8 Currrent Development 3
What is E-Commerce?
• Electronic commerce or e-commerce refers to a wide range of
online business activities for products and services. It also pertains
to “any form of business transaction in which the parties interact
electronically rather than by physical exchanges or direct physical
contact.”
• Business activities running over the Internet and World Wide Web
platform
• Use of TV and toll-free telephones in business are not usually
associated with e-commerce
Unit-8 Currrent Development 4
What is E-Commerce? (Continue…….)
• A more complete definition is: E-commerce is the use of
electronic communications and digital information processing
technology in business transactions to create, transform, and
redefine relationships for value creation between or among
organizations, and between organizations and individuals.
• Depending on the situation, electronic data interchange (EDI)
may or may not be associated with e-commerce. EDI is used for
business-to-business transactions
Unit-8 Currrent Development 5
Pure Vs. Partial Electronic Commerce
–Three dimensions
• the product (service) sold [physical / digital]; • the process [physical / digital] • the delivery agent (or intermediary) [physical / digital]
–Traditional commerce
• all dimensions are physical
–Pure EC
• all dimensions are digital
–Partial EC
• all other possibilities include a mix of digital and physical dimensions
Unit-8 Currrent Development 6
Is e-commerce the same as e-business?
e-commerce and e-business are generally used interchangeably, but they
are the distinct concepts.
In e-commerce, information and communications technology (ICT) is
used in inter-business or inter-organizational transactions (transactions
between and among firms/organizations) and in business-to-consumer
transactions (transactions between firms/organizations and individuals).
While in e-business, on the other hand, ICT is used to enhance one’s
business. It includes any process that a business organization (either a for-
profit, governmental or non-profit entity) conducts over a computer-
mediated network.
Unit-8 Currrent Development 7
Three primary processes are enhanced in e-business:
1. Production processes, which include procurement, ordering and
replenishment of stocks; processing of payments; electronic links with
suppliers; and production control processes, among others.
2. Customer-focused processes, which include promotional and
marketing efforts, selling over the Internet, processing of customers’ purchase orders and payments, and customer support, among others.
3. Internal management processes, which include employee services,
training, internal information-sharing, video-conferencing, and
recruiting.
Electronic applications enhance information flow between production
and sales forces to improve sales force productivity. Workgroup
communications and electronic publishing of internal business
information are likewise made more efficient.
Unit-8 Currrent Development 8
Types of E-Commerce
• Business-to-Business (B 2 B) e-commerce
• Business-to-Consumer ( B 2 C) e-commerce
• Consumer-to-consumer (C 2 C) e-commerce
• Business-to-Government (B 2 G) e-commerce
• Mobile Commerce (m-commerce)
Unit-8 Currrent Development 10
Business-to-Business (B 2 B) e-commerce
• B2B e-commerce is simply defined as e-commerce between
companies.
• This is the type of e-commerce that deals with relationships between
and among businesses. About 80% of e-commerce is of this type.
• The B2B market has two primary components: e-frastructure and e-
markets.
• The more common B2B examples and best practice models are IBM,
Hewlett Packard (HP), Cisco and Dell. Cisco, for instance, receives
over 90% of its product orders over the Internet.
Unit-8 Currrent Development 11
Business-to-Consumer ( B 2 C) e-commerce
• Business-to-consumer e-commerce, or commerce between
companies and consumers.
• B2C involves customers gathering information; purchasing
physical goods or information goods and receiving products
over an electronic network.
• It is the second largest and the earliest form of e-commerce.
Its origins can be traced to online retailing (or e-tailing). Thus,
the more common B2C business models are the online retailing
companies such as Amazon.com, Drugstore.com, Beyond.com.
Unit-8 Currrent Development 12
Consumer-to-Consumer (C 2 C) e-commerce
• Consumer-to-consumer e-commerce or C2C is simply commerce
between private individuals or consumers.
• This type of e-commerce is characterized by the growth of
electronic marketplaces and online auctions, particularly in vertical
industries where firms/businesses can bid for what they want from
among multiple suppliers. It perhaps has the greatest potential for
developing new markets.
Unit-8 Currrent Development 13
This type of e-commerce comes in at least three forms:
• Auctions facilitated at a portal, such as eBay, which allows
online real-time bidding on items being sold in the Web.
•Peer-to-peer systems, such as the Napster model (a protocol for
sharing files between users used by chat forums similar to IRC)
and other file exchange and later money exchange models.
• Classified ads at portal sites such as Excite Classifieds and
eWanted (an interactive, online marketplace where buyers and
sellers can negotiate and which features “Buyer Leads & Want
Ads”).
Unit-8 Currrent Development 14
• Business-to-government e-commerce is generally defined as commerce between companies
and the public sector.
• It refers to the use of the Internet for public procurement, licensing procedures, and other
government-related operations.
• This kind of e-commerce has two features: first, the public sector assumes a pilot/leading role
in establishing e-commerce; and second, it is assumed that the public sector has the greatest
need for making its procurement system more effective.
• Web-based purchasing policies increase the transparency of the procurement process (and
reduces the risk of irregularities). To date, however, the size of the B2G e-commerce market as
a component of total e-commerce is insignificant, as government e-procurement systems
remain undeveloped.
Business-to-Government (B 2 G) e-commerce
Unit-8 Currrent Development 15
Mobile Commerce (m-commerce)
• M-commerce (mobile commerce) is the buying and selling of
goods and services through wireless technology-i.e., handheld
devices such as cellular telephones and personal digital assistants
(PDAs).
• As content delivery over wireless devices becomes faster, more
secure, and scalable.
• Industries affected by m-commerce include:
* Financial services
* Telecommunications
* Service/retail
* Information services
Unit-8 Currrent Development 16
Components of a typical successful E-Commerce
E-commerce does not refer merely to a firm putting up a Web site for
the purpose of selling goods to buyers over the Internet. But
number of technical as well as enabling issues have to be
considered.
A typical e-commerce transaction loop involves the following major
players and corresponding requisites:
• Seller
• Transaction Partners
• Consumers
• Business / Firm
• Government
• Internet
Big Data Unit-8 Currrent Development
17
Introduction to Big Data
What is Big Data?
What makes data, “Big” Data?
18
Unit-8 Currrent Development
Big Data Definition
• No single standard definition…
“Big Data” is data whose scale, diversity, and complexity require new architecture, techniques, algorithms, and
analytics to manage it and extract value and hidden
knowledge from it…
19
Unit-8 Currrent Development
Characteristics of Big Data:
1-Scale (Volume)
• Data Volume
• 44x increase from 2009 2020
• From 0.8 zettabytes to 35zb
• Data volume is increasing exponentially
20 Exponential increase in
collected/generated data
Unit-8 Currrent Development
Characteristics of Big Data:
2-Complexity (Varity) • Various formats, types, and structures
• Text, numerical, images, audio, video,
sequences, time series, social media
data, multi-dim arrays, etc…
• Static data vs. streaming data
• A single application can be
generating/collecting many types of
data
21 To extract knowledge all these types of
data need to linked together
Unit-8 Currrent Development
Characteristics of Big Data:
3-Speed (Velocity) • Data is begin generated fast and need to be processed fast
• Online Data Analytics
• Late decisions missing opportunities
• Examples
• E-Promotions: Based on your current location, your purchase history, what
you like send promotions right now for store next to you
• Healthcare monitoring: sensors monitoring your activities and body
any abnormal measurements require immediate reaction
22
Unit-8 Currrent Development
Big Data: 3V’s
23
Unit-8 Currrent Development
Some Make it 4V’s
24
Unit-8 Currrent Development
Who’s Generating Big Data
Social media and networks
(all of us are generating data) Scientific instruments
(collecting all sorts of data)
Mobile devices
(tracking all objects all the time)
Sensor technology and networks
(measuring all kinds of data)
• The progress and innovation is no longer hindered by the ability to collect data
• But, by the ability to manage, analyze, summarize, visualize, and discover
knowledge from the collected data in a timely manner and in a scalable
fashion 25
Unit-8 Currrent Development
The Model Has Changed…
• The Model of Generating/Consuming Data has Changed
Old Model: Few companies are generating data, all others are consuming data
New Model: all of us are generating data, and all of us are consuming data
26
Unit-8 Currrent Development
What’s driving Big Data
- Ad-hoc querying and reporting
- Data mining techniques
- Structured data, typical sources
- Small to mid-size datasets
- Optimizations and predictive analytics
- Complex statistical analysis
- All types of data, and many sources
- Very large datasets
- More of a real-time
27
Unit-8 Currrent Development
Big Data Technology
28
Unit-8 Currrent Development
Could Computing Unit-8 Currrent Development
29
What is Cloud Computing? • Cloud Computing is a general term used to describe a new
class of network based computing that takes place over the
Internet,
• basically a step on from Utility Computing
• a collection/group of integrated and networked hardware,
software and Internet infrastructure (called a platform).
• Using the Internet for communication and transport provides
hardware, software and networking services to clients
• These platforms hide the complexity and details of the
underlying infrastructure from users and applications by
providing very simple graphical interface or API (Applications
Programming Interface).
30 Unit-8 Currrent Development
What is Cloud Computing?
• In addition, the platform provides on demand services, that are
always on, anywhere, anytime and any place.
• Pay for use and as needed, elastic
• scale up and down in capacity and functionalities
• The hardware and software services are available to
• general public, enterprises, corporations and businesses markets
31 Unit-8 Currrent Development
Cloud Summary • Cloud computing is an umbrella term used to refer to
Internet based development and services
• A number of characteristics define cloud data,
applications services and infrastructure:
• Remotely hosted: Services or data are hosted on remote
infrastructure.
• Ubiquitous: Services or data are available from anywhere.
• Commodified: The result is a utility computing model
similar to traditional that of traditional utilities, like gas and
electricity - you pay for what you would want!
32 Unit-8 Currrent Development
Cloud Architecture
33 Unit-8 Currrent Development
What is Cloud Computing
34 Adopted from: Effectively and Securely Using the Cloud Computing Paradigm by peter Mell, Tim Grance
• Shared pool of configurable computing resources
• On-demand network access
• Provisioned by the Service Provider
Unit-8 Currrent Development
Cloud Computing Characteristics
35
Common Characteristics:
Low Cost Software
Virtualization Service Orientation
Advanced Security
Homogeneity
Massive Scale Resilient Computing
Geographic Distribution
Essential Characteristics:
Resource Pooling
Broad Network Access Rapid Elasticity
Measured Service
On Demand Self-Service
Adopted from: Effectively and Securely Using the Cloud Computing Paradigm by peter Mell, Tim Grance Unit-8 Currrent Development
Cloud Service Models
36
Software as a Service (SaaS)
Platform as a Service (PaaS)
Infrastructure as a Service (IaaS)
Google App
Engine
SalesForce CRM
LotusLive
Adopted from: Effectively and Securely Using the Cloud Computing Paradigm by peter Mell, Tim Grance Unit-8 Currrent Development
SaaS Maturity Model
37
Source: Frederick Chong and Gianpaolo Carraro, “Architectures Strategies for Catching the Long Tail”
Level 2: Configurable per
customer
Level 3: configurable &
Multi-Tenant-Efficient
Level 1: Ad-Hoc/Custom – One Instance per customer
Level 4: Scalable,
Configurable & Multi-Tenant-
Efficient
Unit-8 Currrent Development
Different Cloud Computing Layers Application Service
(SaaS)
Application Platform
Server Platform
Storage Platform Amazon S3, Dell, Apple, ...
3Tera, EC2, SliceHost,
GoGrid, RightScale, Linode
Google App Engine, Mosso,
Force.com, Engine Yard,
Facebook, Heroku, AWS
MS Live/ExchangeLabs, IBM,
Google Apps; Salesforce.com
Quicken Online, Zoho, Cisco
38 Unit-8 Currrent Development
Services
Application
Development
Platform
Storage
Hosting
Cloud Computing Service Layers Description
Services – Complete business services such as PayPal,
OpenID, OAuth, Google Maps, Alexa
Services
Application
Focused
Infrastructure
Focused
Application – Cloud based software that eliminates the need
for local installation such as Google Apps, Microsoft Online
Storage – Data storage or cloud based NAS such as
CTERA, iDisk, CloudNAS
Development – Software development platforms used to
build custom cloud based applications (PAAS & SAAS) such
as SalesForce
Platform – Cloud based platforms, typically provided using
virtualization, such as Amazon ECC, Sun Grid
Hosting – Physical data centers such as those run by IBM,
HP, NaviSite, etc.
39 Unit-8 Currrent Development
Basic Cloud
Characteristics
• The “no-need-to-know” in terms of the underlying details of infrastructure, applications interface with
the infrastructure via the APIs.
• The “flexibility and elasticity” allows these systems to scale up and down at will
• utilising the resources of all kinds
• CPU, storage, server capacity, load balancing, and databases
• The “pay as much as used and needed” type of utility computing and the “always on!, anywhere and
any place” type of network-based computing.
40 Unit-8 Currrent Development
Basic Cloud
Characteristics
• Cloud are transparent to users and applications, they can be
built in multiple ways
• branded products, proprietary open source, hardware or software,
or just off-the-shelf PCs.
• In general, they are built on clusters of PC servers and off-the-
shelf components plus Open Source software combined with
in-house applications and/or system software.
41 Unit-8 Currrent Development
Software as a Service
(SaaS) • SaaS is a model of software deployment where an
application is hosted as a service provided to
customers across the Internet.
• Saas alleviates the burden of software
maintenance/support
• but users relinquish control over software versions and
requirements.
• Terms that are used in this sphere include
• Platform as a Service (PaaS) and
• Infrastructure as a Service (IaaS)
42 Unit-8 Currrent Development
Cloud Taxonomy
43 Unit-8 Currrent Development
Cloud Storage
• Several large Web companies are now exploiting the
fact that they have data storage capacity that can be
hired out to others.
• allows data stored remotely to be temporarily cached on
desktop computers, mobile phones or other Internet-linked
devices.
• Amazon’s Elastic Compute Cloud (EC2) and Simple Storage Solution (S3) are well known examples
• Mechanical Turk
44 Unit-8 Currrent Development
Amazon Simple Storage Service (S3)
• Unlimited Storage.
• Pay for what you use:
• $0.20 per GByte of data transferred,
• $0.15 per GByte-Month for storage used,
• Second Life Update:
• 1TBytes, 40,000 downloads in 24 hours - $200,
45 Unit-8 Currrent Development
Advantages of Cloud Computing
• Lower computer costs: • You do not need a high-powered and high-priced computer to
run cloud computing's web-based applications.
• Since applications run in the cloud, not on the desktop PC, your desktop PC does not need the processing power or hard disk space demanded by traditional desktop software.
• When you are using web-based applications, your PC can be less expensive, with a smaller hard disk, less memory, more efficient processor...
• In fact, your PC in this scenario does not even need a CD or DVD drive, as no software programs have to be loaded and no document files need to be saved.
46 Unit-8 Currrent Development
Advantages of Cloud Computing
• Improved performance: • With few large programs hogging your computer's memory,
you will see better performance from your PC.
• Computers in a cloud computing system boot and run faster because they have fewer programs and processes loaded into memory…
• Reduced software costs: • Instead of purchasing expensive software applications, you
can get most of what you need for free-ish! • most cloud computing applications today, such as the Google Docs suite.
• better than paying for similar commercial software • which alone may be justification for switching to cloud applications.
47 Unit-8 Currrent Development
Advantages of Cloud Computing
• Instant software updates:
• Another advantage to cloud computing is that you are no longer faced
with choosing between obsolete software and high upgrade costs.
• When the application is web-based, updates happen automatically
• available the next time you log into the cloud.
• When you access a web-based application, you get the latest version
• without needing to pay for or download an upgrade.
• Improved document format compatibility.
• You do not have to worry about the documents you create on your
machine being compatible with other users' applications or OSes
• There are potentially no format incompatibilities when everyone is
sharing documents and applications in the cloud.
48 Unit-8 Currrent Development
Advantages of Cloud Computing • Unlimited storage capacity: • Cloud computing offers virtually limitless storage.
• Your computer's current 1 Tbyte hard drive is small compared to the hundreds of Pbytes available in the cloud.
• Increased data reliability: • Unlike desktop computing, in which if a hard disk crashes
and destroy all your valuable data, a computer crashing in the cloud should not affect the storage of your data. • if your personal computer crashes, all your data is still out there in the
cloud, still accessible
• In a world where few individual desktop PC users back up their data on a regular basis, cloud computing is a data-safe computing platform!
49 Unit-8 Currrent Development
Advantages of Cloud Computing
• Universal document access: • That is not a problem with cloud computing, because you do
not take your documents with you.
• Instead, they stay in the cloud, and you can access them whenever you have a computer and an Internet connection
• Documents are instantly available from wherever you are
• Latest version availability: • When you edit a document at home, that edited version is
what you see when you access the document at work.
• The cloud always hosts the latest version of your documents • as long as you are connected, you are not in danger of having an outdated
version
50 Unit-8 Currrent Development
Advantages of Cloud Computing
• Easier group collaboration:
• Sharing documents leads directly to better collaboration.
• Many users do this as it is an important advantages of cloud
computing
• multiple users can collaborate easily on documents and projects
• Device independence.
• You are no longer tethered to a single computer or network.
• Changes to computers, applications and documents follow you
through the cloud.
• Move to a portable device, and your applications and
documents are still available.
51 Unit-8 Currrent Development
Disadvantages of Cloud Computing
• Requires a constant Internet connection:
• Cloud computing is impossible if you cannot connect to the
Internet.
• Since you use the Internet to connect to both your
applications and documents, if you do not have an Internet
connection you cannot access anything, even your own
documents.
• A dead Internet connection means no work and in areas
where Internet connections are few or inherently unreliable,
this could be a deal-breaker.
52 Unit-8 Currrent Development
Disadvantages of Cloud Computing
• Does not work well with low-speed connections: • Similarly, a low-speed Internet connection, such as that
found with dial-up services, makes cloud computing painful at best and often impossible.
• Web-based applications require a lot of bandwidth to download, as do large documents.
• Features might be limited: • This situation is bound to change, but today many web-
based applications simply are not as full-featured as their desktop-based applications. • For example, you can do a lot more with Microsoft PowerPoint than
with Google Presentation's web-based offering
53 Unit-8 Currrent Development
Disadvantages of Cloud Computing
• Can be slow:
• Even with a fast connection, web-based applications can
sometimes be slower than accessing a similar software
program on your desktop PC.
• Everything about the program, from the interface to the
current document, has to be sent back and forth from your
computer to the computers in the cloud.
• If the cloud servers happen to be backed up at that moment,
or if the Internet is having a slow day, you would not get the
instantaneous access you might expect from desktop
applications.
54 Unit-8 Currrent Development
Disadvantages of Cloud Computing
• Stored data might not be secure:
• With cloud computing, all your data is stored on the cloud.
• The questions is How secure is the cloud?
• Can unauthorised users gain access to your confidential data?
• Stored data can be lost:
• Theoretically, data stored in the cloud is safe, replicated across
multiple machines.
• But on the off chance that your data goes missing, you have no
physical or local backup.
• Put simply, relying on the cloud puts you at risk if the cloud lets you
down.
55 Unit-8 Currrent Development
Data Warehousing Unit-8 Currrent Development
56
Unit-8 Currrent Development
57 57
Which are our lowest/highest margin customers ?
Who are my customers and what products are they buying?
Which customers are most likely to go to the competition ?
What impact will new products/services have on revenue and margins?
What product prom- -otions have the biggest impact on revenue?
What is the most effective distribution channel?
A producer wants to know….
Unit-8 Currrent Development
58
58
Data, Data everywhere
yet ... • I can’t find the data I need
• data is scattered over the network
• many versions, subtle differences
I can’t get the data I need
need an expert to get the data
I can’t understand the data I found
available data poorly documented
I can’t use the data I found
results are unexpected
data needs to be transformed from one form to other
59
What is a Data Warehouse?
A single, complete and
consistent store of data
obtained from a variety of
different sources made
available to end users in a
what they can understand and
use in a business context.
[Barry Devlin]
Unit-8 Currrent Development
60
What are the users saying...
• Data should be integrated across
the enterprise
• Summary data has a real value to
the organization
• Historical data holds the key to
understanding data over time
• What-if capabilities are required
Unit-8 Currrent Development
61
What is Data Warehousing?
A process of transforming
data into information and
making it available to users
in a timely enough manner to
make a difference
[Forrester Research, April 1996]
Data
Information
Unit-8 Currrent Development
62
Evolution
• 60’s: Batch reports • hard to find and analyze information
• inflexible and expensive, reprogram every new request
• 70’s: Terminal-based DSS and EIS (executive information systems) • still inflexible, not integrated with desktop tools
• 80’s: Desktop data access and analysis tools • query tools, spreadsheets, GUIs
• easier to use, but only access operational databases
• 90’s: Data warehousing with integrated OLAP engines and tools
Unit-8 Currrent Development
63
Warehouses are Very Large
Databases 35%
30%
25%
20%
15%
10%
5%
0% 5GB
5-9GB
10-19GB 50-99GB 250-499GB
20-49GB 100-249GB 500GB-1TB
Initial
Projected 2Q96
Source: META Group, Inc.
Resp
ondents
Unit-8 Currrent Development
64
Very Large Data Bases
• Terabytes -- 10^12 bytes:
• Petabytes -- 10^15 bytes:
• Exabytes -- 10^18 bytes:
• Zettabytes -- 10^21 bytes:
• Zottabytes -- 10^24 bytes:
Walmart -- 24 Terabytes
Geographic Information Systems
National Medical Records
Weather images
Intelligence Agency Videos
Unit-8 Currrent Development
65
Data Warehousing -
It is a process
• Technique for assembling and
managing data from various sources
for the purpose of answering business
questions. Thus making decisions that
were not previous possible
• A decision support database
maintained separately from the
organization’s operational database
Unit-8 Currrent Development
66
Data Warehouse
• A data warehouse is a
• subject-oriented
• integrated
• time-varying
• non-volatile
collection of data that is used primarily in
organizational decision making.
-- Bill Inmon, Building the Data Warehouse 1996
Unit-8 Currrent Development
69
Unit-8 Currrent Development
Data Mining Unit-8 Currrent Development
71
Data mining
• Process of semi-automatically analyzing large databases to
find patterns that are:
• valid: hold on new data with some certainty
• novel: non-obvious to the system
• useful: should be possible to act on the item
• understandable: humans should be able to interpret the pattern
• Also known as Knowledge Discovery in Databases (KDD)
72
Unit-8 Currrent Development
Why Data Mining • Credit ratings/targeted marketing:
• Given a database of 100,000 names, which persons are the least likely to default on their credit cards?
• Identify likely responders to sales promotions
• Fraud detection
• Which types of transactions are likely to be fraudulent, given the demographics and transactional history of a particular customer?
• Customer relationship management:
• Which of my customers are likely to be the most loyal, and which are most likely to leave for a competitor? :
Data Mining helps extract such information 73
Unit-8 Currrent Development
Applications • Banking: loan/credit card approval • predict good customers based on old customers
• Customer relationship management: • identify those who are likely to leave for a competitor.
• Targeted marketing: • identify likely responders to promotions
• Fraud detection: telecommunications, financial transactions • from an online stream of event identify fraudulent events
• Manufacturing and production: • automatically adjust knobs when process parameter changes
74
Unit-8 Currrent Development
Applications (continued)
• Medicine: disease outcome, effectiveness of treatments • analyze patient disease history: find relationship
between diseases
• Molecular/Pharmaceutical: identify new drugs
• Scientific data analysis: • identify new galaxies by searching for sub clusters
• Web site/store design and promotion: • find affinity of visitor to pages and modify layout
75
Unit-8 Currrent Development
The KDD process
• Problem fomulation
• Data collection
• subset data: sampling might hurt if highly skewed data
• feature selection: principal component analysis, heuristic search
• Pre-processing: cleaning
• name/address cleaning, different meanings (annual, yearly), duplicate removal, supplying missing values
• Transformation:
• map complex objects e.g. time series data to features e.g. frequency
• Choosing mining task and mining method:
• Result evaluation and Visualization:
Knowledge discovery is an iterative process
76
Unit-8 Currrent Development
Relationship with other fields
• Overlaps with machine learning, statistics, artificial
intelligence, databases, visualization but more stress
on
• scalability of number of features and instances
• stress on algorithms and architectures whereas foundations
of methods and formulations provided by statistics and
machine learning.
• automation for handling large, heterogeneous data
77
Unit-8 Currrent Development
Some basic operations
• Predictive:
• Regression
• Classification
• Collaborative Filtering
• Descriptive:
• Clustering / similarity matching
• Association rules and variants
• Deviation detection
78
Unit-8 Currrent Development
Data Mining in
Practice
79
Unit-8 Currrent Development
Application Areas
Industry Application
Finance Credit Card Analysis
Insurance Claims, Fraud Analysis
Telecommunication Call record analysis
Transport Logistics management
Consumer goods Promotion analysis
Data Service providers Value added data
Utilities Power usage analysis
80 Unit-8 Currrent Development
Why Now? • Data is being produced
• Data is being warehoused
• The computing power is available
• The computing power is affordable
• The competitive pressures are strong
• Commercial products are available
81
Unit-8 Currrent Development
Data Mining works with
Warehouse Data
• Data Warehousing provides the
Enterprise with a memory
ÑData Mining provides the Enterprise with intelligence
82 Unit-8 Currrent Development
Usage scenarios
• Data warehouse mining: • assimilate data from operational sources
• mine static data
• Mining log data
• Continuous mining: example in process control
• Stages in mining: • data selection pre-processing: cleaning
transformation mining result evaluation visualization
83
Unit-8 Currrent Development
Mining market
• Around 20 to 30 mining tool vendors • Major tool players: • Clementine, • IBM’s Intelligent Miner, • SGI’s MineSet, • SAS’s Enterprise Miner.
• All pretty much the same set of tools • Many embedded products: • fraud detection: • electronic commerce applications, • health care, • customer relationship management: Epiphany
84
Unit-8 Currrent Development
Green Computing Unit-8 Currrent Development
85
Green Computing
Unit-8 Currrent Development
86
The positive (or least negative) relationship between the
physical computer and its impact to the environments in
which it moves through from cradle to grave
Cradle to Grave Approach
• Full life-cycle analysis, not just the product on our desks
• Looks at:
• Manufacturing
• Use during lifetime
• Disposal and reallocation
Green Computing • Why • computer energy is often wasteful • leaving the computer on when not in use (CPU and fan consume power,
screen savers consume power)
• printing is often wasteful • how many of you print out your emails or meeting agendas
• printing out partial drafts
• for a “paperless” society, we tend to use more paper today than before computer-prevalence
• pollution • manufacturing techniques
• packaging
• disposal of computers and components
• toxicity • as we will see, there are toxic chemicals used in the manufacturing of
computers and components which can enter the food chain and water! 87
Unit-8 Currrent Development
Energy Use of PCs • CPU uses 120 Watts
• CRT uses 150 Watts • 8 hours of usage, 5 days a week = 562 KWatts • if the computer is left on all the time without proper power saver
modes, this can lead to 1,600 KWatts
• for a large institution, say a university of 40,000 students and faculty, the power bill for just computers can come to $2 million / year
• Energy use comes from • electrical current to run the CPU, motherboard, memory
• running the fan and spinning the disk(s)
• monitor (CRTs consume more power than any other computer component)
• printers
88
Unit-8 Currrent Development
Chemical Elements Found in Computers and Components
• Elements in bulk: lead, tin, copper, silicon, carbon, iron and aluminum
• Elements in small amounts: cadmium and mercury
• Elements in trace amounts: • germanium, gallium, barium, nickel, tantalum, indium, vanadium, terbium,
beryllium, gold, europium, titanium, ruthenium, cobalt, palladium, manganese, silver, antimony, bismuth, selenium, niobium, yttrium, rhodium, platinum, arsenic, lithium, boron, americium
• List of examples of devices containing these elements • almost all electronics contain lead & tin (as solder) and copper (as wire &
PCB tracks), though the use of lead-free solder is now spreading rapidly
• lead: solder, CRT monitors (Lead in glass), Lead-acid battery
89
Unit-8 Currrent Development
List Continued
• List of examples of devices containing these elements • tin: solder
• copper: copper wire, printed circuit board tracks
• aluminum: nearly all electronic goods using more than a few watts of power
• iron: steel chassis, cases & fixings
• silicon: glass, transistors, ICs, Printed circuit boards.
• nickel & cadmium: nickel-cadmium rechargeable batteries
• lithium: lithium-ion battery
• zinc: plating for steel parts
• gold: connector plating, primarily in computer equipment
• mercury: fluorescent tubes (numerous applications), tilt switches (pinball games, mechanical doorbells)
• sulphur: lead-acid battery
• carbon: steel, plastics, resistors
90
Unit-8 Currrent Development
Disposal • Reuse: donate your computer components to people who
may not have or have lesser quality computers • inner city schools, churches, libraries, third world countries • this however leads to the older computers being dumped but there is
probably no way around this as eventually the older computers would be discarded anyway
• Refurbish: rather than discarding your computer when the next generation is released, just get a new CPU and memory chips – upgrade rather than replace • while you will still be discarded some components, you will
retain most of the computer system (e.g., monitor, the system unit housing, cables)
• Are there adequate incentives to do either of the above? Do computer companies encourage refurbishing/upgrading?
91
Unit-8 Currrent Development
One More Solution: Recycling
• If companies can recycle the plastics and other
components, this can greatly reduce waste and toxins
• however, the hazardous materials in e-waste can harm the
recycle workers if they are not properly protected
• in undeveloped countries, a lot of the recycling chores are left up to
unprotected children!
• Developed countries now have facilities for recycling e-
waste
• however, in Europe, the plastics are discarded instead of
recycled because the flame retardant chemicals are too toxic to
work with
• To resolve these problems, the computer manufacturers
must start using recyclable chemicals 92
Unit-8 Currrent Development
How Do the Companies Rate? • 8: Nokia - regained its top position for eliminating the
worst chemicals from many products
• still needs to report on its recycling rate percentage
• 7.3: Dell - still among the top but loses points for not
having models free of the worst chemicals
• strong support for global take back
• 7.3: Lenovo - dropping down the rank for not having a
clear global take back program
• still missing out on products free of the worst chemicals on the
market
• 7: Sony Ericsson - among the top with clear timeline to
have products free of the worst chemicals by 2008
• need better chemicals take back reporting program
93
Unit-8 Currrent Development
Continued
• 6.7: Samsung - strong position for having a good chemical policy, but still lack products that are free from the worst chemicals • its take back system is not yet global and need improvement
• 6.7: Motorola - some products on the market are free from the worst chemicals but loses points for not providing clear timelines for eliminating these chemicals in all products • score points on reporting the recycling rate
• 6: Toshiba - good improvement particularly on waste and take back criteria • moved forward for providing some models without the worst
chemicals and for timelines for complete phase out
• 6: Fujitsu-Siemens - some models free of worst chemicals, but loses point for a weak take back and recycling program
94
Unit-8 Currrent Development
Continued • 5.7: Acer - standing still with improved chemical policies but no
models free of the worst chemicals • needs to improve on take back program
• 5.3: Apple - top mover with concrete timelines to eliminate the worst chemicals • loses points for not have a green product on the market and for a weak take
back program
• 5.3: HP - a free-faller, dropping down for failing to provide clear timelines for eliminating the worst chemicals • it looses points for weak definition of take back policies
• 5: Panasonic - moving up for making available products free of the worst chemicals • loses point for poor take back program
• 4: Sony - at the bottom of the rank for losing penalty point for inconsistent take back policies • some models without the worst chemicals
95
Unit-8 Currrent Development
Wireless Sensor Network Unit-8 Currrent Development
96
Introduction-WSN • Wireless Sensor Networks are networks that
consists of sensors which are distributed in an ad hoc manner.
• These sensors work with each other to sense some physical phenomenon and then the information gathered is processed to get relevant results.
• Wireless sensor networks consists of protocols and algorithms with self-organizing capabilities.
97
Unit-8 Currrent Development
Example of WSN
98
Unit-8 Currrent Development
Ref:http://esd.sci.univr.it/images/wsn-example.png
Comparison with ad hoc
networks
• Wireless sensor networks mainly use broadcast communication
while ad hoc networks use point-to-point communication.
• Unlike ad hoc networks wireless sensor networks are limited by
sensors limited power, energy and computational capability.
• Sensor nodes may not have global ID because of the large
amount of overhead and large number of sensors.
99
Unit-8 Currrent Development
Applications of Wireless Sensor
networks
The applications can be divided in three categories:
1. Monitoring of objects.
2. Monitoring of an area.
3. Monitoring of both area and objects.
* Classification due to Culler, Estrin, Srivastava
100
Unit-8 Currrent Development
Monitoring Area
• Environmental and Habitat Monitoring
• Precision Agriculture
• Indoor Climate Control
• Military Surveillance
• Treaty Verification
• Intelligent Alarms
101
Unit-8 Currrent Development
Example: Precision
Agriculture
• Precision agriculture aims at making cultural operations more efficient, while reducing
environmental impact. • The information collected from
sensors is used to evaluate optimum sowing density, estimate fertilizers and other inputs needs, and to more accurately predict crop yields.
102 Unit-8 Currrent Development
Monitoring Objects
• Structural Monitoring
• Eco-physiology
• Condition-based Maintenance
• Medical Diagnostics
• Urban terrain mapping
103
Unit-8 Currrent Development
Example: Condition-based
Maintenance
• Intel fabrication plants
• Sensors collect vibration data, monitor wear and
tear; report data in real-time
• Reduces need for a team of engineers; cutting costs
by several orders of magnitude
104
Unit-8 Currrent Development
Monitoring Interactions between Objects and
Space
• Wildlife Habitats
• Disaster Management
• Emergency Response
• Ubiquitous Computing
• Asset Tracking
• Health Care
• Manufacturing Process Flows
105
Unit-8 Currrent Development
Example: Habitat
Monitoring
• The ZebraNet Project
Collar-mounted sensors monitor zebra movement in
Kenya
Source: Margaret Martonosi, Princeton University 106
Unit-8 Currrent Development
Characteristics of Wireless
Sensor Networks • Wireless Sensor Networks mainly consists of sensors.
Sensors are -
• low power
• limited memory
• energy constrained due to their small size.
• Wireless networks can also be deployed in extreme environmental conditions and may be prone to enemy attacks.
• Although deployed in an ad hoc manner they need to be self organized and self healing and can face constant reconfiguration.
107
Unit-8 Currrent Development
Design Challenges
• Heterogeneity
• The devices deployed maybe of various types and need to
collaborate with each other.
• Distributed Processing
• The algorithms need to be centralized as the processing is
carried out on different nodes.
• Low Bandwidth Communication
• The data should be transferred efficiently between sensors
108
Unit-8 Currrent Development
Continued..
• Large Scale Coordination
• The sensors need to coordinate with each other to produce required results.
• Utilization of Sensors
• The sensors should be utilized in a ways that produce the maximum performance and use less energy.
• Real Time Computation
• The computation should be done quickly as new data is always being generated.
109
Unit-8 Currrent Development
Operational Challenges of Wireless
Sensor Networks
• Energy Efficiency
• Limited storage and computation
• Low bandwidth and high error rates
• Errors are common • Wireless communication
• Noisy measurements
• Node failure are expected
• Scalability to a large number of sensor nodes
• Survivability in harsh environments
• Experiments are time- and space-intensive
110
Unit-8 Currrent Development
Enabling Technologies
Embedded Networked
Sensing
Control system w/
Small form factor
Untethered nodes
Exploit
collaborative
Sensing, action
Tightly coupled to physical world
Embed numerous distributed
devices to monitor and interact
with physical world
Network devices to coordinate and
perform higher-level tasks
Exploit spatially and temporally dense, in situ, sensing and actuation 111
Unit-8 Currrent Development
Future of WSN Smart Home / Smart Office
• Sensors controlling
appliances and electrical
devices in the house.
• Better lighting and
heating in office
buildings.
• The Pentagon building
has used sensors
extensively. 112
Unit-8 Currrent Development
Biomedical / Medical
• Health Monitors
• Glucose
• Heart rate
• Cancer detection
• Chronic Diseases
• Artificial retina
• Cochlear implants
• Hospital Sensors
• Monitor vital signs
• Record anomalies 113
Unit-8 Currrent Development
Industrial & Commercial
• Numerous industrial and commercial applications:
• Agricultural Crop Conditions
• Inventory Tracking
• In-Process Parts Tracking
• Automated Problem Reporting
• RFID – Theft Deterrent and Customer Tracing
• Plant Equipment Maintenance Monitoring
114
Unit-8 Currrent Development
Traffic Management & Monitoring
Future cars could use wireless sensors to:
Handle Accidents
Handle Thefts
Sensors embedded in the roads to:
–Monitor traffic flows
–Provide real-time route updates
115
Unit-8 Currrent Development
Unit-8 Currrent Development
116