The value of Knowledge Representation and the Decentralized Knowledge Base for Artificial Intelligence (expert systems)

in #ai7 years ago (edited)


Introduction 

This article contains an explanation of two core concepts for creating decentralized artificial intelligence and also discusses some projects which are attempting to bring these concepts into practical reality. The first of these concepts is called knowledge representation. The second of these concepts is called a knowledge base. Human beings contribute to a knowledge base using a knowledge representation language. Reasoning over this knowledge base is possible and artificial intelligence utilizing this knowledge base is also possible.


Knowledge representation defined by it's roles


To define knowledge representation we must list the five roles of knowledge representation which can reveal what it does.


1. Knowledge representation is a surrogate

2. Knowledge representation is a set of ontological commitments

3. Knowledge representation is a fragmentary theory of intelligent reasoning

4. Knowledge representation is a medium for efficient computation

Part 1: Knowledge Representation is a Surrogate

By surrogate we means it is substituting or acting in place of something. So if knowledge representation is a surrogate then it must be representing some original. There is of course an issue that the surrogate must be a completely accurate representation but if we want a completely accurate representation of an object then it can only come from the object itself. In this case all other representations are inaccurate as they inevitably contain simplifying assumptions and possibly artifacts. To put this into a context, if you make a copy of an audio recording, for every copy you make it going to contain slightly more artifacts. This similarly also happens when dealing with information sent through a wire, where if not properly amplified there eventually will be artifects that come from copying a transmission. 

"Two important consequences follow from the inevitability of imperfect surrogates. One consequence is that in describing the natural world, we must inevitably lie, by omission at least. At a minimum we must omit some of the effectively limitless complexity of the natural world; our descriptions may in addition introduce artifacts not present in the world.

The second and more important consequence is that all sufficiently broad-based reasoning about the natural world must eventually reach conclusions that are incorrect, independent of the reasoning process used and independent of the representation employed. Sound reasoning cannot save us: If the world model is somehow wrong (and it must be) some conclusions will be incorrect, no matter how carefully drawn. A better representation cannot save us: all representations are imperfect and any imperfection can be a source of error."

Part 2: Knowledge Representation is a Set of Ontological Commitments


"If, as we have argued, all representations are imperfect approximations to reality, each approximation attending to some things and ignoring others, then in selecting any representation we are in the very same act unavoidably making a set of decisions about how and what to see in the world. That is, selecting a representation means making a set of ontological commitments. (2) The commitments are in effect a strong pair of glasses that determine what we can see, bringing some part of the world into sharp focus, at the expense of blurring other parts."


In this case because our commitments are made then our representation is selected by making a set of ontological commitments. An ontological commitment is a framework for how we will view the world, such as viewing the world through logic. If we choose to view the world through logic, through rule-based systems then all of our knowledge about the world is also within that framework. We choose our representation technology and commit to a particular view of the world.

Part 3: Knowledge Representation is a Fragmentary Theory of Intelligent Reasoning


Mathmaetical logic seems to provide a basis for some of intelligent reasoning but  it is also recognized to be derived from the five fields which include of course mathematical logic, but also psychology, biology, statistics, and economics. If we go with mathematical logic then we have deductive and inductive reasoning approaches. Deductive reasoning according to some is the basis behind. If we want to explore an example of reasoning we can take the Socrates example,

Statement A: True? Y/N?
"All men are mortal"

Statement B: True? Y/N?
"Socrates is a man"

Satement C: True? Y/N?

"Socrates is a mortal"

If A is true, and B is also true, then C must be true.  This is an example of basic logical reasoning which can easily be resolved using symbol manipulation and knowledge representation. The symbol at play in this example would be implication.


Part 4: Knowledge Representation is a Medium for Efficient Computation

If we think of computational efficiency, and think of all forms of computation whether mechanical or natural in the sense of the sort of computation done by a biological entity, then we may think of knowledge representation as a medium for that computation efficiency.  Currently we think of money as a medium of exchange, and if we think of the human brain as a type of computer which does human computation, then we may think of knowledge representation.


While the issue of efficient use of representations has been addressed by representation designers, in the larger sense the field appears to have been historically ambivalent in its reaction.  Early recognition of the notion of heuristic adequacy [16] demonstrates that early on  researchers appreciated the significance of the computational properties of a representation,  but the tone of much subsequent work in logic (e.g., [13]) suggested that epistemology (knowledge content) alone mattered, and defined computational efficiency out of the agenda.  Epistemology does of course matter, and it may be useful to study it without the potentially distracting concerns about speed.  But eventually we must compute with our representations, hence efficiency must be part of the agenda.  The pendulum later swung sharply over, to what we might call the computational imperative view.  Some work in this vein (e.g., [15]) offered representation languages whose design was strongly driven by the desire to provide not only efficiency, but guaranteed efficiency.  The result appears to be a language of significant speed but restricted expressive power [6]


While I will admit the above paragraph may be a bit cryptic, shows that there is a view that better representation of knowledge leads to computational efficiency. 

Part 5: Knowledge Representation is a Medium of Human Expression

Of course knowledge representation is part of how we communicate with each other or with machines. Human beings use natural language to convey knowledge and this natural language can include the use of vocabularies of words with agreed upon meanings. This vocabulary of words may be found in various dictionaries including the urban dictionary and we rely on these dictionaries as a sort of knowledge base. 

What is a decentralized Knowledge Base?

To understand what a decentralized knowledge base is we must first describe what a knowledge base is. A knowledge base stores knowledge representations which are described in the above examples. This knowledge base in more simple terms could be thought of as representing the facts about the world in the form of structured and or unstructured information which can be utilized by a computer system. An artificial intelligence can utilize a knowledge base to solve problems and typically this particular kind of artificial intelligence is called an expert system. The artificial intelligence in the most simple form will just reason on this knowledge base through an inference engine and through this it can do the sort of computations which are of great utility to problem solvers.

When we think of Wikipedia we are thinking about an encyclopedia which the whole world can contribute to. When we think about the problems with Wikipedia we can quickly see that one of the problems is the fact that it's centralized. We also have the problem that the knowledge that is stored on Wikipedia is not stored in a way which machines can make use of it and this means even if Wikipedia can be useful for humans to look up facts it is not in the current form able to act effectively as a decentralized knowledge base. DBPedia is an attempt to bring Wikipedia into a form which machines can make use of but it still is centralized which means a DDOS or similar attack can censor it.

Decentralized knowledge is important for the world and a decentralized knowledge base is critical for the development of a decentralized AI. If we are speaking about an expert system then the knowledge base would have to be as large as possible which means we may need to give the incentive for human beings to contribute and share their knowledge with this decentralized knowledge base. We also would have to provide a knowledge representation language so that human beings can share their knowledge in the appropriate way for it to enter into the knowledge base to be used by potential AI.


Conclusion

Knowledge representation is a necessary component for the vast majority of attempts at a truly decentralized AI. If we are going to deal with any AI then we must have a way for human beings to convey knowledge to the machines in a way which both the human beings and machines can understand it. The use of a knowledge representation language makes it possible for a human being to contribute to a knowledge base and this ultimately allows for machines to make use of it's inference engine capabilities to reason from this knowledge base. In the case of a decentralized knowledge base then the barrier of entry is low or non-existent and any human being or perhaps any living being or even robots can contribute to this shared resource yet at the same time both humans and machines can gain utility from this shared resource.  An artificial intelligence which functions similar to an expert system can make use of an extremely large knowledge base to solve complex problems and a decentralized knowledge base combined with open and decentralized access to this artificial intelligence can benefit humanity and life on earth in general if used appropriately.

Discussion of example projects


Tauchain


One of the well known attempts to do something like this is Tauchain which will have both a knowledge representation system and a decentralized knowledge base. In the case of Tau there will be a special simple knowledge representation language under development which resembles simplified controlled English. This knowledge representation language will allow anyone to contribute to the collective knowledge base. Tauchain eventually will have a decentralized knowledge base over the course of it's evolution from the first alpha.


Lunyr


Unfortunately upon reading the Lunyr whitepaper and following their public materials I fail to see how they will pull off what they are promising. I do not think the current Ethereum can handle concurrency which probably would be necessary for doing AI. I also don't see how Ethereum would be able to do it securely with the current design although I remain optimistic about Casper. The lack of code on Github, the lack of references to their research, does not allow me to completely analyze their approach. I can see based on the fact that they are talking about a decentralized knowledge base that their approach will require more than the magic of the market combined with pretty marketing. They will require a knowledge representation language, they will require a true decentralized knowledge base built into IPFS. This true decentralized knowledge base will have to scale with IPFS and through this maybe they can achieve something but without a clear plan of action I would have to say that today I'm not confident in their approach or in Ethereum's ability to handle doing it efficiently.



References

1. http://groups.csail.mit.edu/medg/ftp/psz/k-rep.html

2. https://en.wikipedia.org/wiki/Inference_engine

3. https://en.wikipedia.org/wiki/Knowledge_base

4.  https://en.wikipedia.org/wiki/Syllogism
5. https://en.wikipedia.org/wiki/Knowledge-based_systems
6. https://en.wikipedia.org/wiki/Expert_system

Sort:  

Hmm, as a person with no programming experience and who was just watching a video of a kitten riding a turtle, this piece was a little demanding on my pea sized brain

In addition to greater efficiency in building a knowledge base, I believe one of the greatest advantages of the decentralization of AI is its ability to reduce the probability of minority bad actors successfully harnessing the technology for large scale nefarious ends.

Also, I did not follow your reasoning on Part 4. The paper you cited concluded that at times there is a trade off between expressiveness and computational efficiency, yet you interpreted this as 'better representational knowledge leads to computational efficiency.' Surely in certain cases, for example telling a joke, highly efficient methods of representation such as syllogistic logic is not the best mode of delivery? If chicken crosses road Then chicken wanted to get to the other side.

Very interesting read as usual

Yes that particular quote is a bit cryptic and not easy for me to explain. You always have trade offs and yes expressiveness can cost you efficiency.

This quote is perhaps much better:

One contribution to this pragmatic efficiency is supplied by the guidance a representation provides for organizing information so as to facilitate making the recommended inferences.

Which I interpreted (and this interpretation could be wrong) that KR contributes to pragmatic efficiency through how it's organized such that it can facilitate RECOMMENDED INFERENCES.

And then we have this quote below:

While sanctioned inferences tell us what conclusions we are permitted to make, that set is invariably very large and hence provides insufficient constraint. Any automated system attempting to reason, guided only by knowing what inferences are sanctioned, will soon find itself overwhelmed by choices. Hence we need more than an indication of which inferences we can legally make, we also need some indication of which inferences which inferences are appropriate to make, i.e., intelligent. That indication is supplied by the set of recommended inferences.

So in other words I think by efficiency they are talking about reducing or narrowing down the inferences from that which is sanctioned (available to be chosen) to that which is intelligent and or appropriate to choose. I guess I would think of it as common sense?

This last quote is also vital:

Note that the need for a specification of recommended inferences means that in specifying a representation we also need to say something about how to reason intelligently. Representation and reasoning are inextricably, and usefully, intertwined: a knowledge representation is a theory of intelligent reasoning.

Humans are good at common sense and can do humor. Machines have a very difficult time with that because by default a machine would not narrow the search space or know certain stereotypes or common human inferences for certain situations. It's the difference between sanctioned and recommended, between reason and common sense.

References

  1. http://groups.csail.mit.edu/medg/ftp/psz/k-rep.html

😍😵😂

Finally a bitta craic ;)

Very excited for what you bring to steemit ❤

Cheers @dana-edwards! Very clever. Keep posting!

Coin Marketplace

STEEM 0.20
TRX 0.14
JST 0.029
BTC 66599.39
ETH 3336.69
USDT 1.00
SBD 2.70