500 likes | 730 Views
Uncertainty in Expert Systems. CPS 4801. Uncertainty. Uncertainty is the lack of exact knowledge that would enable us to reach a fully reliable solution. Classical logic assumes perfect knowledge exists: IF A is true THEN B is true Describing uncertainty:
E N D
Uncertainty in Expert Systems CPS 4801
Uncertainty • Uncertainty is the lack of exact knowledge that would enable us to reach a fully reliable solution. • Classical logic assumes perfect knowledge exists: IF A is true THEN B is true • Describing uncertainty: • If A is true, then B is true with probability P
Sources of uncertainty • Weak implications: Want to be able to capture associations and correlations, not just cause and effect. • Imprecise language: • How often is “sometimes”? • Can we quantify “often,” “sometimes,” “always?” • Unknown data: In real problems, data is often incomplete or missing. • Differing experts: Experts often disagree, or have different reasons for agreeing. • Solution: attach weight to each expert
Two approaches • Bayesian reasoning • Bayesian rule (Bayes’ rule) by Thomas Bayes • Bayesian network (Bayes network) • Certainty factors
Probability Theory P(success) + P(failure) = 1 • The probability of an event is the proportion of cases in which the event occurs • Numerically ranges from zero to unity (an absolute certainty) (i.e. 0 to 1)
Example • Flip a coin • P(head) = ½ P(tail) = ? • P(head) = ¼ P(tail) = ? • Throw a dice • P(getting a 6) = ? • P(not getting a 6) = ? • P(A) = p P(¬A) = 1-p
Example • P(head) = ½ P(head head head) = ? • Xi = result of i-th coin flip Xi = {head, tail} • P(X1 = X2 = X3 = X4) = ? • Until now, events are independent and mutually exclusive. • P(X,Y) = P(X)P(Y) (P(X,Y) is joint probability.)
Example • P( {X1 X2 X3 X4} contains >= 3 head ) = ?
Conditional Probability probability of A given B • Suppose events A and B are not mutually exclusive, but occur conditionally on the occurrence of the other • The probability that event A will occur if event B occurs is called the conditional probability
Conditional Probability • The probability that bothA and B occur is called the joint probability of A and B, written p(A∩B)
Conditional Probability • Similarly, the conditional probability that event B will occur if event A occurs can be written as:
The Bayesian Rule • The Bayesian rule (named after Thomas Bayes, an 18th-century British mathematician):
Applying Bayes’ rule • A = disease, B = symptom • P(disease|symptom) = P(symptom|disease) * P(disease) / P(symptom)
Applying Bayes’ rule • A doctor knows that the disease meningitis causes the patient to have a stiff neck for 70% of the time. • The probability that a patient has meningitis is 1/50,000. • The probability that any patient has a stiff neck is 1%. • P(s|m) = 0.7 • P(m) = 1/50000 • P(s) = 0.01
Applying Bayes’ rule • P(s|m) = 0.7 • P(m) = 1/50000 • P(s) = 0.01 • P(m|s) = P(s|m) * P(m) / P(s) • = 0.7 * 1/50000 / 0.01 • = 0.0014 • = around 1/714 • Conclusion: Less than 1 in 700 patients with a stiff neck have meningitis.
Example: Coin Flip • P(X1 = H) = ½ 1) X1 is H: P(X2 = H | X1 = H) = 0.9 2) X1 is T: P(X2 = T | X1 = T ) = 0.8 P(X2 = H) = ?
What we learned from the example? • If event A depends on exactly two mutually exclusive events, B and ¬B, we obtain: • P(¬X|Y) = 1 – P(X|Y) • P(X|¬Y) = 1 – P(X|Y)?
Conditional probability • If event A depends on exactly two mutually exclusive events, B and ¬B, we obtain: • Similarly, if event B depends on exactly two mutually exclusive events, A and ¬A, we obtain:
The Bayesian Rule • Substituting p(B) into the Bayesian rule yields:
Bayesian reasoning • Instead of A and B, consider H (a hypothesis) and E (evidence for that hypothesis). • Expert systems use the Bayesian rule to rank potentially true hypotheses based on evidences
Bayesian reasoning • If event E occurs, then the probability thatevent H will occur is p(H|E) IF E (evidence) is true THEN H (hypothesis) is true with probability p
Bayesian reasoning Example: Cancer and Test • P(C) = 0.01 P(¬C) = 0.99 • P(+|C) = 0.9 P(-|C) = 0.1 • P(+|¬C) = 0.2 P(-|¬C) = 0.8 • P(C|+) = ?
Bayesian reasoning • Expert identifies prior probabilities forhypotheses p(H) and p(¬H) • Expert identifies conditional probabilities for: • p(E|H): Observing evidence E if hypothesis H is true • p(E|¬H): Observing evidence E if hypothesis H is false
Bayesian reasoning • Experts provide p(H), p(¬H), p(E|H), and p(E|¬H) • Users describe observed evidence E • Expert system calculates p(H|E) using Bayesian rule • p(H|E) is the posterior probability that hypothesis H occurs upon observing evidence E • What about multiple hypotheses and evidences?
Bayesian reasoning with multiple hypotheses • Expand the Bayesian rule to work with multiple hypotheses (H1...Hm)
Bayesian reasoning with multiple hypotheses and evidences • Expand the Bayesian rule to work with multiple hypotheses (H1...Hm) and evidences (E1...En)
Bayesian reasoning with multiple hypotheses and evidences • Expand the Bayesian rule to work with multiple hypotheses (H1...Hm) and evidences (E1...En) Assuming conditional independence among evidences E1...En
Bayesian reasoning Example • Expert is given three conditionally independent evidences E1, E2, and E3 • Expert creates three mutually exclusive and exhaustive hypotheses H1, H2, and H3 • Expert provides prior probabilities p(H1), p(H2), p(H3) • Expert identifies conditional probabilities for observing each evidence Ei for all possible hypotheses Hk
Bayesian reasoning Example • Expert data:
Bayesian reasoning Example user observes E3 expert system computes posterior probabilities
Bayesian reasoning Example user observes E1 expert system computes posterior probabilities
Bayesian reasoning Example expert system computes posterior probabilities user observes E2
Bayesian reasoning Example • Initial expert-based ranking: • p(H1) = 0.40; p(H2) = 0.35; p(H3) = 0.25 • Expert system ranking after observing E1, E2, E3: • p(H1) = 0.45; p(H2) = 0.0; p(H3) = 0.55
Problems with the Bayesianapproach • Humans are not very good at estimating probability! • In particular, we tend to make different assumptions when calculating prior and conditional probabilities • Reliable and complete statistical information often not available. • Bayesian approach requires evidences to be conditionally independent – often not the case. • One solution: certainty factors