A Survey of the Hysteretic Duhem Model
- Cite this article as:
- Ikhouane, F. Arch Computat Methods Eng (2017). doi:10.1007/s11831-017-9218-3
- 6 Downloads
Abstract
The Duhem model is a simulacrum of a complex and hazy reality: hysteresis. Introduced by Pierre Duhem to provide a mathematical representation of thermodynamical irreversibility, it is used to describe hysteresis in other areas of science and engineering. Our aim is to survey the relationship between the Duhem model as a mathematical representation, and hysteresis as the object of that representation.
1 Prolegomenon
Citing a reference allows the author of a scientific article to attribute work and ideas to the correct source. Nonetheless, the process of describing that work and these ideas assumes some interpretation, at least of their relative importance. In order to ensure that the interpretation is reliable, we use, whenever adequate, a quotation from the reference so that the reader has a direct access to the cited source. This direct access is even more important when the source is not easily available like Ref. [67]1 or is not written in English like Refs. [16, 17, 18, 19, 20, 21, 22] among others, in which case we provide a translation. This is our approach to the literature review of Sect. 2.
In Sects. 4–9 we proceed differently since our aim in these sections is to provide an accurate description of the results presented in the references under study. Because of the diversity of notations and nomenclature in these references, quotations may not be the best way to transmit that accurate description. Instead, we summarize the references using a unifying framework provided in Ref. [35]. The references we have chosen in Sects. 4–9 are, in our opinion, those that are relevant to the subject of this study.
Our aim in this work is also to shed light on the relationships between the concepts introduced in this paper. To this end, we use a special form of the Duhem model, the scalar semilinear one, as a case study.
2 Introduction and Literature Review
A brief history of the Duhem model The term hysteresis was coined by J. A. Ewing in 1881 to describe a specific relationship between the torsion of a magnetized wire and its polarization (although the phenomenon of hysteresis has been known and described by several authors before that date as shown in the literature review provided in Ref. [65]).
Quoting from Ewing’s paper [28]: “These curves exhibit, in a striking manner, a persistence of previous state, such as might be caused by molecular friction. The curves for the back and forth twists are irreversible, and include a wide area between them. The change of polarization lags behind the change of torsion. To this action …
However, the first in-depth study of hysteresis is due to Pierre Duhem2 in the period 1896–1902. A detailed review of Duhem’s work on hysteresis may be found in [67, Chapter IV] so that we provide here only those elements of that extensive work that are directly related to the present paper.
To understand the motivation for Duhem’s work we quote from [67, p. 306]: “take a metallic wire under strain by means of a load. We can take the length of the wire and its temperature as variables that define its state. The gravity weight P will represent the external action. At temperature T and under the load P the wire may be at equilibrium with length l. Give P infinitely small variations, the length l and temperature T will also experience infinitely small variations, and a new equilibrium may be achieved. In this last state, give the gravity weight and temperature variations equal in absolute value, but of opposite signs to the previous ones. The length l should experience a variation equal to the previous one with opposite sign. However, experimentation shows that this is not the case. In general, to the expansion of the wire corresponds a smaller contraction, and the difference lasts with time.”
This permanent deformation is the subject of a seven-memoirs research by Duhem, see Refs. [16]–[22]. In his first memoir submitted to the section of sciences of the Académie de Belgique on October 13, 1894, and reviewed by the mathematician Charles Lagrange in Ref. [44],3 Duhem writes: “The attempts to make the different kinds of permanent deformations compatible with the principles of thermodynamics have been few up till now. Only one of these attempts, due to M. Marcel Brillouin, appears to us worthy of interest.” [16, p. 3]. Duhem analyzes the work of Brillouin and concludes that it is not compatible with the principles of thermodynamics [16, p. 6] (see also [19, pp. 5–7]).
To the best of our knowledge, the first reference that called the form (5) “Duhem model” is Ref. [48] in 1993.4 Indeed, the authors of Ref. [43] attributed erroneously Duhem’s model to Madelung [63, p. 797].5
Between 1916 when P. Duhem dies and 1993 when his model of hysteresis is finally attributed to him, Duhem’s work on hysteresis does not have a relevant impact. Major references on hysteresis like Refs. [8, 12] or [56] do not cite his memoirs. Several authors propose different forms of the Duhem model without a direct reference to Duhem’s memoirs. This is the case of the Coleman and Hodgdon model of magnetic hysteresis [12], the Dahl model of friction [13], the model (5) in Ref. [3], and a generalized form of the model (5) in Ref. [43, p. 95]. In 1952, Everett cites briefly Duhem’s work as follows [24, p. 751]: “From a thermodynamic standpoint the introduction of an additional variable whose value depends on the history of the system is sufficient for a formal discussion (cf. Duhem [ref]
A general theory of physics based on a molecular interpretation was precisely what Duhem rejected. In a review of his work presented in 1913 for his application to the Académie des Sciences, Duhem writes that his “doctrine should note imitate the numerous mechanical theories proposed by physicists hitherto; to the observable properties that apparatus measure, it will not substitute hidden movements of hypothetical bodies” [67, p. 74].
In recent times, Duhem’s phenomenological approach is becoming more accepted [5, 9, 46, 52, 57]. Indeed, “hysteretic phenomena arising in structural and mechanical systems are so complicated that there has been no well-accepted mathematical model which can describe all observed hysteretic characteristics.” [52, p. 1408]. Moreover, the Preisach model which was believed to describe the constitutive behavior of magnetic hysteresis, has shown to be a phenomenological model [49, p. 2].
Several reasons are invoked for the use of Duhem’s model to describe hysteresis. On the one hand, “differential equation-based models lead to a particularly simple phenomenological description” [46, p. C8–545]. On the other hand, the “Duhem models [sic] …
There are several generalizations of the original Duhem model (5). The following generalization is proposed in [43, p. 95]: ˙x(t)=f(t,x(t),u(t),˙u(t))
Why are there different generalized forms of the Duhem model ? To answer this question we have to recall the concept of rate independence.6
Graph “Force–displacement” with hysteresis
…
Rate independence is used by Visintin to define hysteresis :“Definition. Hysteresis = Rate Independent Memory Effect.”[64, p. 13]. However, “this definition excludes any viscous-type memory” [64, p. 13] because it leads to rate-dependent effects that increase with velocity. A definition based on rate independence assumes that “the presence of hysteresis loops is not … an essential feature of hysteresis.” [64, p. 14].
This point of view is challenged by Oh and Bernstein who consider hysteresis as a “nontrivial quasi-dc input-output closed curve” [54, p. 631] and propose a modified version of the Duhem model which can represent rate-dependent or rate-independent effects. A characterization of hysteresis systems using hysteresis loops is also addressed by Ikhouane in Ref. [35] through the concepts of consistency and strong consistency.
In light of what has been said it becomes clear that, in Ref. [64], the generalizations of Duhem’s model are done in such a way that rate independence is preserved, whilst a definition of hysteresis based on hysteresis loops in Ref. [54] is compatible with a generalized form of the Duhem model that may be rate dependent or rate independent.
Why are there different models of hysteresis? In Ref. [16] Duhem proposes his model to account for the irreversibility in the modifications of equilibria observed experimentally in magnetic hysteresis [16, Chap. IV], sulfur [17], red phosphorus [19, Chap. III], and in different processes of metallurgy [19].
Preisach [56] uses “plausible hypotheses concerning the physical mechanisms of magnetization” [49, p. 1] to elaborate a model of magnetic hysteresis. This model is also proposed and studied by Everett and co-workers [24, 25, 26, 27] who postulate “that hysteresis is to be attributed in general to the existence in a system of a very large number of independent domains, at least some of which can exhibit metastability.” [24, p. 753].
Krasnosel’skiǐ and Pokrovskiǐ point out to the issue of admissible inputs, as “it is by no means clear a priori for any concrete transducer with hysteresis, how to choose the relevant classes of admissible inputs” [43, p. 5]. This is why they introduce the concept of vibro-correctness which allows the determination of the output of a hysteresis transducer that corresponds to any continuous input, once we know the outputs that correspond to piecewise monotone continuous inputs [43, p. 6]. The models that Krasnosel’skiǐ and Pokrovskiǐ propose (ordinary play, generalized play, hysteron) are vibro-correct, although the authors acknowledge the existence of hysteresis models that may not be vibro-correct like the Duhem model.7
Hysteresis models based on a feedback interconnection between a linear system and a static nonlinearity are proposed in Ref. [55]. The authors study “hysteresis arising from a continuum of equilibria … and hysteresis arising from isolated equilibria” [55, p 101].
A review of hysteresis models is provided in Ref. [48] and a detailed study of these (and other) models may be found in Refs. [7, 10, 14, 37], [49, 64].
In light of what has been said, the diversity of hysteresis models is due to the wide range of areas to which hysteresis is concomitant, and the diversity of methods and assumptions underlying the elaboration of these models.
Note that all mathematical models of hysteresis share a common property: they model hysteresis. This fact leads us to our next question.
What is hysteresis? A description found in many papers is that hysteresis “refers to the systems that have memory, where the effects of input to the system are experienced with a certain delay in time.” [33, p. 210]. This description is misleading as it applies also to dynamic linear systems. Indeed, when the output y is related to the input u by ˙x=Ax+Bu
Mayergoyz considers hysteresis as a rate-independent phenomenon which is “consistent with existing experimental facts.” [49, p. 16]. However, “for very fast input variations, time effects become important and the given definition of rate-independent hysteresis fails.” [49, p. 16]. Also, “in the existing literature, hysteresis phenomenon is by and large linked with the formation of hysteresis loops (looping). This may be misleading and create the impression that looping is the essence of hysteresis. In this respect, the given definition of hysteresis emphasizes the fact that history dependent branching constitutes the essence of hysteresis, while looping is a particular case of branching.” [49, pp. 16–18].
Following Mayergoyz, “All rate-independent hysteresis nonlinearities fall into two general classifications: (a) hysteresis nonlinearities with local memories, and (b) hysteresis nonlinearities with nonlocal memories.” [49, pp. 17]. In a hysteresis with a local memory, the state or output at time t≥t0
In Ref. [54], Oh and Bernstein consider the generalized Duhem model ˙x=f(x,u)g(˙u)
In a PhD thesis advised by Bernstein [15], Drinčić considers systems of the form ˙x=f(x,u)
In Ref. [6] Bernstein states that “a hysteretic system must be multistable; conversely, a multistable system is hysteretic if increasing and decreasing input signals cause the state to be attracted to different equilibria that give rise to different outputs.” Multistability means that “the system must have multiple attracting equilibria for a constant input value” [6].
In Ref. [50], Morris presents six examples of hysteresis systems taken from the areas of electronics, biology, mechanics, and magnetics; hysteresis being understood as a “characteristic looping behavior of the input-output graph” [50, p. 1]. The author explains the qualitative behavior of these systems from the point of view of multistability. For “the differential equations used to model the Schmitt trigger, cellular signaling and a beam in a magnetic field” it is observed that “these systems, all possess, for a range of constant inputs, several stable equilibrium points.” [50, p. 13]. The author observes that the systems are rate dependent for high input rates.
For the play operator, the Preisach model and the Bouc-Wen model which are rate independent, “these models present a continuum of equilibrium points.” [50, p. 13]. These observations lead the author to conclude that “hysteresis is a phenomenon displayed by forced dynamical systems that have several equilibrium points; along with a time scale for the dynamics that is considerably faster than the time scale on which inputs vary.” [50, p. 13]. Morris proposes the following definition.
“A hysteretic system is one which has (1) multiple stable equilibrium points and (2) dynamics that are considerably faster than the time scale at which inputs are varied.” [50, p. 13].
In Ref. [35], Ikhouane considers a hysteresis operator “H
The author introduces two changes in time scale: (1) a linear one which is applied to a given input, and (2) a -possibly- nonlinear one which is the total variation of the original input. When the input is composed with the linear time-scale change, both the input and the output are re-scaled with respect to the total variation of the input, which provides a normalized input independent of the linear time-scale change, and a normalized output. Consistency is defined as being the convergence of the normalized outputs in the space L∞
Strong consistency is defined as the property that the limit of the normalized outputs, seen a parametrized curve, converges to a periodic orbit which characterizes the hysteresis loop.
The author does not propose a definition of hysteresis, but considers that consistency and strong consistency are properties of a class of hysteresis systems.
Aim of the paper The aim of the paper is to survey the research carried out on the Duhem model from the perspective of its hysteretic properties.
Organization of the paper Section 4 presents some results obtained in Ref. [43], namely the concept of vibro-correctness, sufficient conditions to ensure global solutions of the scalar rate independent Duhem model, and a study of the continuity of the model seen as an operator. Section 5 presents a definition of hysteresis proposed in Ref. [54] that uses a generalized form of Duhem’s model as a tool to get that formal definition. Section 6 presents the concepts of consistency and strong consistency introduced in Ref. [35]. The tools and notations of Ref. [35] are also used as a unifying framework to present the results of the present paper. Section 7 presents a characterization of the generalized Duhem model obtained in Ref. [51]. Section 8 summarizes the results obtained in Ref. [40] in relation with the study of the dissipativity of the Duhem model. Section 9 summarizes some results obtained in Ref. [64] in relation with the existence of a Duhem operator, its smoothness, and some generalizations of the model. Section 10 is a note that explores the minor loops of hysteresis systems with particular emphasis on the Duhem model. For ease of reference, some results on the existence and uniqueness of the solutions of differential equations are presented in Appendix 15.
To illustrate the results obtained in Sects. 4–10, and to analyze the relationships between these results, we use the scalar semilinear Duhem model as a case study. The corresponding mathematical analysis is stated in various lemmas and theorems provided in Section 11, whose proofs are given in 16–20. The relationships between the results obtained in Sects. 4–9 are commented upon in Section 12. These comments lead to the formulation of several open problems in Sect. 13 and a conjecture in Sect. 11.9.
3 Terminology and Notations
A real number x is said to be strictly positive when x>0
An ordered pair a, b is denoted (a, b) whilst the open interval {t∈R∣a<t<b}
The Lebesgue measure on R
W1,∞(R+,Rl)
L∞(R+,Rl)
C0(R+,Rl)
∀γ∈]0,∞[
limx↑a
Let U be a set and let T∈]0,∞[
4 A Summary of the Results Obtained in Ref. [43]
This section presents those results obtained in Ref. [43] that are relevant to the present paper. This is in particular the case of the concept of vibro-correctness which allows to extend the set of admissible inputs from continuously differentiable to continuous.
4.1 The Concept of Vibro-Correctness
Definition 1
In the following we analyze the consequences of vibro-correctness. Consider a sequence of inputs uk∈E(δ0,u∗)
Thus, the concept of vibro-correctness allows to extend the definition of the operator W
Another consequence of Property (ii) is the uniqueness of the solutions of (6)–(7). This means that it is not necessary to state explicitly in Property (i) that the differential equation (6)–(7) has a unique solution (this is what is done in Ref. [43]; see also [43, p. 104]).
Definition 2
[43, p. 98] If we consider only constant inputs u∗
4.2 Global Solutions of the Scalar Rate-Independent Duhem Model
4.3 Continuity of the Rate-Independent Duhem Model Seen as an Operator
For any given initial condition x0∈R
Theorem 2
[43, Theorem 29.1] The operatorZx0
sup{u∈S(t0,t2)∣‖u‖S≤a}‖Zx0(u)‖S<∞
5 A Summary of the Results Obtained in Ref. [54]
This section presents those results obtained in Ref. [54] that are relevant to the present paper. In particular, the authors of Ref. [54] propose a definition that decides whether a given generalized Duhem model is a hysteresis or not.
5.1 The Generalized Duhem Model
Assumption 1
For every (u,x0)∈W1,∞(R+,R)×Rn
From Assumption 1 we get y∈C0(R+,R)∩L∞(R+,R)
Define the operator Ho:W1,∞(R+,R)×Rn→C0(R+,R)∩L∞(R+,R)